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Related Applications 

This application is a continuation-in-part of U.S. Patent Application No. 10/228,063, filed 
August 27, 2002, which claims priority to Application Serial No. 60/315,281, filed August 27, 
2001 , each of which is herein incorporated by reference in their entirety. 

Field of the Invention 

The present invention generally relates to the field of plant molecular biology, and more 
specifically, to the creation of plants that express a processing enzyme which provides a desired 
characteristic to the plant or products thereof. 

Background of the Invention 

Enzymes are used to process a variety of agricultural products such as wood, fruits and 
vegetables, starches, juices, and the like. Typically, processing enzymes are produced and 
recovered on an industrial scale from various sources, such as microbial fermentation (Bacillus 
a-amylase), or isolation from plants (coffee 0-galactosidase or papain from plant parts). Enzyme 
preparations are used in different processing applications by mixing the enzyme and the substrate 
under the appropriate conditions of moisture, temperature, time, and mechanical mixing such 
that the enzymatic reaction is achieved in a commercially viable manner. The methods involve 
separate steps of enzyme production, manufacture of an enzyme preparation, mixing the enzyme 
and substrate, and subjecting the mixture to the appropriate conditions to facilitate the enzymatic 
reaction. A method that reduces or eliminates the time, energy, mixing, capital expenses, and/or 
enzyme production costs, or results in improved or novel products, would be useful and 
beneficial. One example of where such improvements are needed is in the area of corn milling. 

Today corn is milled to obtain cornstarch and other corn-milling co-products such as corn 
gluten feed, corn gluten meal, and corn oil. The starch obtained from the process is often further 
processed into other products such as derivatized starches and sugars, or fermented to make a 
variety of products including alcohols or lactic acid. Processing of cornstarch often involves the 
use of enzymes, in particular, enzymes that hydrolyze and convert starch into fermentable sugars 
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or fructose (a- and gluco-amylase, a? -glucosidase, glucose isomerase, and the like). The process 
used commercially today is capital intensive as construction of very large mills is required to 
process corn on scales required for reasonable cost-effectiveness. In addition the process 
requires the separate manufacture of starch-hydrolyzing or modifying enzymes and then the 
machinery to mix the enzyme and substrate to produce the hydrolyzed starch products. 

The process of starch recovery from com grain is well known and involves a wet-milling 
process. Corn wet-milling includes the steps of steeping the corn kernel, grinding the corn 
kernel and separating the components of the kernel. The kernels are steeped in a steep tank with 
a countercurrent flow of water at about 120° F and the kernels remain in the steep tank for 24 to 
48 hours. This steepwater typically contains sulfur dioxide at a concentration of about 0.2% by 
weight. Sulfur dioxide is employed in the process to help reduce microbial growth and also to 
reduce disulfide bonds in endosperm proteins to facilitate more efficient starch-protein 
separation. Normally, about 0.59 gallons of steepwater is used per bushel of corn. The 
steepwater is considered waste and often contains undesirable levels of residual sulfur dioxide. 

The steeped kernels are then dewatered and subjected to sets of attrition type mills. The 
first set of attrition type mills rupture the kernels releasing the germ from the rest of the kernel. 
A commercial attrition type mill suitable for the wet milling business is sold under the brand 
name Bauer. Centrifugation is used to separate the germ from the rest of the kernel. A typical 
commercial centrifugation separator is the Merco centrifugal separator. Attrition mills and 
centrifugal separators are large expensive items that use energy to operate. 

In the next step of the process, the remaining kernel components including the starch, 
hull, fiber, and gluten are subjected to another set of attrition mills and passed through a set of 
wash screens to separate the fiber components from the starch and gluten (endosperm protein). 
The starch and gluten pass through the screens while the fiber does not. Centrifugation or a third 
grind followed by centrifugation is used to separate the starch from the endosperm protein. 
Centrifugation produces a starch slurry which is dewatered, then washed with fresh water and 
dried to about 1 2% moisture. The substantially pure starch is typically further processed by the 
use of enzymes. 
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The separation of starch from the other components of the grain is performed because 
removing the seed coat, embryo and endosperm proteins allows one to efficiently contact the 
starch with processing enzymes, and the resulting hydrolysis products are relatively free from 
contaminants from the other kernel components. Separation also ensures that other components 
of the grain are effectively recovered and can be subsequently sold as co-products to increase the 
revenues from the mill. 

After the starch is recovered from the wet-milling process it typically undergoes the 
processing steps of gelatinization, liquefaction and dextrinization for maltodextrin production, 
and subsequent steps of saccharification, isomerization and refining for the production of 
glucose, maltose and fructose. 

Gelatinization is employed in the hydrolysis of starch because currently available 
enzymes cannot rapidly hydrolyze crystalline starch. To make the starch available to the 
hydrolytic enzymes, the starch is typically made into a slurry with water (20-40% dry solids) and 
heated at the appropriate gelling temperature. For cornstarch this temperature is between 105- 
1 10° C. The gelatinized starch is typically very viscous and is therefore thinned in the next step 
called liquefaction. Liquefaction breaks some of the bonds between the glucose molecules of the 
starch and is accomplished enzymatically or through the use of acid. Heat-stable endo a - 
amylase enzymes are used in this step, and in the subsequent step of dextrinization. The extent 
of hydrolysis is controlled in the dextrinization step to yield hydrolysis products of the desired 
percentage of dextrose. 

Further hydrolysis of the dextrin products from the liquefaction step is carried out by a 
number of different exo-amylases and debranching enzymes, depending on the products that are 
desired. And finally if fructose is desired then immobilized glucose isomerase enzyme is 
typically employed to convert glucose into fructose. 

Dry-mill processes of making fermentable sugars (and then ethanol, for example) from 
cornstarch facilitate efficient contacting of exogenous enzymes with starch. These processes are 
less capital intensive than wet-milling but significant cost advantages are still desirable, as often 
the co-products derived from these processes are not as valuable as those derived from wet- 
milling. For example, in dry milling com, the kernel is ground into a powder to facilitate 
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efficient contact of starch by degrading enzymes. After enzyme hydrolysis of the corn flour the 
residual solids have some feed value as they contain proteins and some other components. 
Eckhoff recently described the potential for improvements and the relevant issues related to dry 
milling in a paper entitled "Fermentation and costs of fuel ethanol from corn with quick-germ 
process" ( Appl. Biochem. Biotechnol. . 94: 41 (2001)). The "quick germ" method allows for the 
separation of the oil-rich germ from the starch using a reduced steeping time. 

One example where the regulation and/or level of endogenous processing enzymes in a 
plant can result in a desirable product is sweet corn, Typical sweet corn varieties are 
distinguished from field corn varieties by the fact that sweet corn is not capable of normal levels 
of starch biosynthesis. Genetic mutations in the genes encoding enzymes involved in starch 
biosynthesis are typically employed in sweet corn varieties to limit starch biosynthesis. Such 
mutations are in the genes encoding starch synthases and ADP-glucose pyrophosphorylases 
(such as the sugary and super-sweet mutations). Fructose, glucose and sucrose,which are the 
simple sugars necessary for producing the palatable sweetness that consumers of edible fresh 
com desire, accumulate in the developing endosperm of such mutants. However, if the level of 
starch accumulation is too high, such as when the com is left to mature for too long (late harvest) 
or the corn is stored for an excessive period before it is consumed, the product loses sweetness 
and takes on a starchy taste and mouthfeel. The harvest window for sweet corn is therefore quite 
narrow, and shelf-life is limited. 

Another significant drawback to the farmer who plants sweet corn varieties is that the 

usefulness of these varieties is limited exclusively to edible food. If a farmer wanted to forego 

■ 

harvesting his sweet corn for use as edible food during seed development , the crop would be 
essentially a loss. The grain yield and quality of sweet corn is poor for two fundamental reasons. 
The first reason is that mutations in the starch biosynthesis pathway cripple the starch 
biosynthetic machinery and the grains do not fill out completely, causing the yield and quality to 
be compromised. Secondly, due to the high levels of sugars present in the grain and the inability 
to sequester these sugars as starch, the overall sink strength of the seed is reduced, which 
exacerbates the reduction of nutrient storage in the grain. The endosperms of sweet corn variety 
seeds are shrunken and collapsed, do not undergo proper desiccation, and are susceptible to 
diseases. The poor quality of the sweet com grain has further agronomic implications; as poor 
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seed viability, poor germination, seedling disease susceptibility, and poor early seedling vigor 
result from the combination of factors caused by inadequate starch accumulation. Thus, the poor 
quality issues of sweet corn impact the consumer, farmer/grower, distributor, and seed producer. 

Thus, for dry-milling, there is a need for a method which improves the efficiency of the 
process and/or increases the value of the co-products. For wet-milling, there is a need for a 
method of processing starch that does not require the equipment necessary for prolonged 
steeping, grinding, milling, and/or separating the components of the kernel. For example, there 
is a need to modify or eliminate the steeping step in wet milling as this would reduce the amount 
of waste water requiring disposal, thereby saving energy and time, and increasing mill capacity 
(kernels would spend less time in steep tanks). There is also a need to eliminate or improve the 
process of separating the starch-containing endosperm from the embryo. 

Summary of the Invention v 
The present invention is directed to self-processing plants and plant parts and methods of 
using the same. The self-processing plant and plant parts of the present invention are capable of 
expressing and activating enzyme(s) (mesophilic, thermophilic, and/or hyperthermophilic). 
Upon activation of the enzyme(s) (mesophilic, thermophilic, or hyperthermophilic) the plant or 
plant part is capable of self-processing the substrate upon which it acts to obtain the desired 
result. 

The present invention is directed to an isolated polynucleotide a) comprising SEQ ID 

NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or the complement thereof, or a 

polynucleotide which hybridizes to the complement of any one of SEQ ID NO: 2, 4, 6, 9, 19, 21, 

25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 under low stringency hybridization conditions and 

encodes a polypeptide having a -amylase, pullulanase, a-glucosidase, glucose isomerase, or 

glucoamylase activity or b) encoding a polypeptide comprising SEQ ID NO: 10, 13, 14, 15, 16, 

1 8, 20 24, 26, 27, 28, 29, 30, 33, 34, 35, 36, 38, 40, 42, 44, 45, 47, 49, or 51 or an enzymatically 

active fragment thereof. Preferably, the isolated polynucleotide encodes a fusion polypeptide 

comprising a first polypeptide and a second peptide, wherein said first polypeptide has a- 

amylase, pullulanase, a-glucosidase, glucose isomerase, or glucoamylase activity. Most 

preferably, the second peptide comprises a signal sequence peptide, which may target the first 
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polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of 
a plant. For example, the signal sequence may be an N-terminal signal sequence from waxy, an 
N-terminal signal sequence from y-zein, a starch binding domain, or a C-terminal starch binding 
domain. Polynucleotides that hybridize to the complement of any one of SEQ ID NO: 2, 9, or 52 
under low stringency hybridization conditions and encodes a polypeptide having a -amylase 
activity; to the complement of SEQ ED NO: 4 or 25 under low stringency hybridization 
conditions and encodes a polypeptide having pullulanase activity; to the complement of SEQ ED 
NO:6 and encodes a polypeptide having a -glucosidase activity; to the complement of of any one 
of SEQ ID NO: 19, 21 , 37, 39, 41 , or 43 under low stringency hybridization conditions and 
encodes a polypeptide having glucose isomerase activity; to the complement of any one of SEQ 
ED NO: 46, 48, 50, or 59 under low stringency hybridization conditions and encodes a 
polypeptide having glucoamylase activity are further encompassed. 

The present invention is also directed to an isolated polynucleotide a) comprising SEQ ID 
NO: 61, 63, 65, 79; 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or the complement 
thereof, or a polynucleotide which hybridizes to the complement of any one of SEQ ID NO: 61, 

63, 65, 79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, or 1 10 under low stringency 
hybridization conditions and encodes a polypeptide having xylanase, cellulase, glucanase, beta 
glucosidase, esterase or phytase activity b) encoding a polypeptide comprising SEQ ED NO: 62, 

64, 66, 70, 80, 82, 84, 86, 88, 90, 92, 109, or 1 1 1 or an enzymatically active fragment thereof. 
The isolated polynucleotide may encode a fusion polypeptide comprising a first polypeptide and 
a second peptide, wherein said first polypeptide has xylanase, cellulase, glucanase, beta 
glucosidase, protease, or phytase activity. The second peptide may comprises a signal sequence 
peptide, which may target the first polypeptide to a vacuole, endoplasmic reticulum, chloroplast, 
starch granule, seed or cell wall of a plant. For example, the signal sequence may be an N- 
terminal signal sequence from waxy, an N-terminal signal sequence from y-zein, a starch binding 
domain, or a C-terminal starch binding domain. 

Exemplary xylanases provided and useful in the invention include those encoded by SEQ 
ED NO: 61, 63, or 65. An exemplary protease, namely bromelain, encoded by SEQ ID NO: 69 is 
also provided. Exemplary cellulases include cellobiohydrolase I and II as provided herein and 
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encoded by SEQ ID NO: 79,81,93, and 94. An exemplary glucanase is provides as 6GP1 
described herein encoded by SEQ ED NO: 85. Exemplary beta glucosidases include beta 
glucosidase 2 and D, as described herein and encoded by SEQ ID NO: 96 and 97. An exemplary 
esterase is also provided, namely ferulic acid esterase as encoded by SEQ ID NO:99. And, an 
exemplary phytase, Nov9X as encoded by SEQ ED NO: 109-1 12 is also provided. 

Also included are expression cassettes comprising a polynucleotide a) having SEQ ID 
NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or the complement thereof, or a 
polynucleotide which hybridizes to the complement of any one of SEQ ID NO: 2, 4, 6, 9, 19, 21, 
25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or under low stringency hybridization conditions and 
encodes an polypeptide having a -amylase, pullulanase, a -glucosidase, glucose isomerase, or 
glucoamylase activity or b) encoding a polypeptide comprising SEQ ID NO: 10, 13, 14, 15, 16, 
18, 20, 24, 26, 27, 28, 29, 30, 33, 34, 35, 36, 38, 40, 42, 44, 45, 47, 49, or 51, or an enzymatically 
active fragment thereof. The expression cassette further comprises a promoter operably linked to 
the polynucleotide, such as an inducible promoter, tissue-specific promoter, or preferably an 
endosperm-specific promoter. Preferably, the endosperm-specific promoter is a maize y-zein 
promoter or a maize ADP-gpp promoter or a maize Q promoter promoter or a rice glutelin-1 
promoter. In a preferred embodiment, the promoter comprises SEQ ED NO: 1 1 or SEQ ID NO: 
1 2 or SEQ ID NO: 67 or SEQ ID NO: 98. Moreover, in another preferred embodiment the 
polynucleotide is oriented in sense orientation relative to the promoter. The expression cassette 
of the present invention may further encode a signal sequence which is operably linked to the 
polypeptide encoded by the polynucleotide. The signal sequence preferably targets the operably 
linked polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell 
wall of a plant. The signal sequences include an N-terminal signal sequence from wary, an N- 
terminal signal sequence from y-zein, or a starch binding domain. 

Moreover, an expression cassette comprising a polynucleotide a) having SEQ ID NO: 61, 
63, 65, 79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or the complement thereof, 
or a polynucleotide which hybridizes to the complement of any one of SEQ ID NO: 61, 63, 65, 
79, 81 , 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or under low stringency 
hybridization conditions and encodes an polypeptide having xylanase, cellulase, glucanase, beta 
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glucosidase, esterase or phytase activity or b) encoding a polypeptide comprising SEQ ED NO: 
62, 64, 66, 70, 80, 82, 84, 86, 88, 90, 92, 109, or 1 1 1, or an enzymatically active fragment 
thereof. The expression cassette further comprises a promoter operably linked to the 
polynucleotide, such as an inducible promoter, tissue-specific promoter, or preferably an 
endosperm-specific promoter. The endosperm-specific promoter may be a maize y-zein 
promoter or a maize ADP-gpp promoter or a maize Q promoter promoter or a rice glutelin-1 
promoter. In an embodiment, the promoter comprises SEQ ID NO: 1 1 or SEQ ID NO: 12 or 
SEQ ID NO: 67 or SEQ ID NO: 98. Moreover, in another embodiment the polynucleotide is 
oriented in sense orientation relative to the promoter. The expression cassette of the present 
invention may further encode a signal sequence which is operably linked to the polypeptide 
encoded by the polynucleotide. The signal sequence preferably targets the operably linked 
polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of 
a plant. The signal sequences include an N-terminal signal sequence from waxy, an N-terminal 
signal sequence from y-zein, or a starch binding domain. 

The present invention is further directed to a vector or cell comprising the expression 
cassettes of the present invention. The cell may be selected from the group consisting of an 
Agrobacterium, a monocot cell, a dicot cell, a Liliopsida cell, a Panicoideae cell, a maize cell, 

and a cereal cell, such as a rice cell, 

Moreover, the present invention encompasses a plant stably transformed with the vectors 
of the present invention. A plant stably transformed with a vector comprising an a-amylase 
having an amino acid sequence of any of SEQ ED NO: 1, 10, 13, 14, 15, 16, 33,35 or 88 or 
encoded by a polynucleotide comprising any of SEQ ID NO: 2, 9, or 87 is provided. 

In another embodiment, a plant stably transformed with a vector comprising a pullulanase 
having an amino acid sequence of any of SEQ ID NO: 24 or 34, or encoded by a polynucleotide 
comprising any of SEQ ED NO: 4 or 25 is provided. A plant stably transformed with a vector 
comprising an a-glucosidase having an amino acid sequence of any of SEQ ID NO: 26 or 27, or 
encoded by a polynucleotide comprising SEQ ID NO:6 is further provided. A plant stably 
transformed with a vector comprising an glucose isomerase having an amino acid sequence of 
any of SEQ ED NO: 18, 20, 28, 29, 30, 38, 40, 42, or 44, or encoded by a polynucleotide 
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comprising any of SEQ ID NO: 19, 21, 37, 39, 41, or 43 is further described herein. In another 
embodiment, a plant stably transformed with a vector comprising a glucose amylase having an 
amino acid sequence of any of SEQ ID NO: 45, 47, or 49, or encoded by a polynucleotide 
comprising any of SEQ ED NO:46, 48, 50, or 59 is described. 

An additional embodiment provides a plant stably transformed with a vector comprising a 
xylanase having an amino acid sequence of any of SEQ ID NO: 62, 64 or 66, or encoded by a 
polynucleotide comprising any of SEQ ID NO: 61, 63, or 65. A plant stably transformed with a 
vector comprising a protease is also provided. The protease may be bromelain having an amino 
acid sequence as set forth in SEQ ID NO: 70, or encoded by a polynucleotide having SEQ ED 
NO: 69. In another embodiment, a plant stably transformed with a vector comprising a cellulase 
is provided. The cellulase may be a cellobiohydrolase encoded by a polynucleotide comprising 
any of SEQ ID NO: 79, 80, 81, 82, 93 or 94. 

An additional embodiment provides a plant stably transformed with a vector comprising a 
glucanase, such as an endoglucanase. The endoglucanase may be endoglucanase I which has an 
amino acid sequence as in SEQ ED NO: 84, or encoded by a polynucleotide comprising SEQ ID 
NO: 83. A plant stably transformed with a vector comprising a beta glucosidase is also provided. 
The beta glucosidase is may be beta glucosidase 2 or beta glucosidase D, which have an amino 
acid sequence set forth in SEQ ID NO: 90 or 92, or encoded by a polynucleotide having SEQ ID 
NO: 89 or 91 . In another embodiment, a plant stably transformed with a vector comprising an 
esterase is provided. The esterase may be a ferulic acid esterase encoded by a polynucleotide 
comprising SEQ ID NO: 99. 

Plant products, such as seed, fruit or grain from the stably transformed plants of the 

present invention are further provided, 

In another embodiment, the invention is directed to a transformed plant, the genome of 
which is augmented with a recombinant polynucleotide encoding at least one processing enzyme 
operably linked to a promoter sequence, the sequence of which polynucleotide is optimized for 
expression in the plant. The plant may be a monocot, such as maize or rice, or a dicot. The 
plant may be a cereal plant or a commercially grown plant. The processing enzyme is selected 
from the group consisting of an a-amylase, glucoamylase, glucose isomerase, glucanase, P- 
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amylase, a-glucosidase, isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
amylopullulanase, cellulase, exo-l,4-p-cellobiohydrolase, exo-l,3-p-D-glucanase, p-glucosidase, 
endoglucanase, L-arabinase, a-arabinosidase, galactanase, galactosidase, mannanase, 
mannosidase, xylanase, xylosidase, protease, glucanase, xylanase, , esterase, phytase, and lipase. 
The processing enzyme is a starch-processing enzyme selected from the group consisting of a- 
amylase, glucoamylase, glucose isomerase, P-amylase, a-glucosidase, isoamylase, pullulanase, 
neo-pullulanase, iso-pullulanase, and amylopullulanase. The enzyme may be selected from or- 
amylase, glucoamylase, glucose isomerase, glucose isomerase, a-glucosidase, and pullulanase. 
The processing enzyme may be hyperthermophilic. In accordance with this aspect of the 
invention, the enzyme may be a non-starch degrading enzyme selected from the group consisting 
of protease, glucanase, xylanase, esterase, phytase, cellulase, beta glucosidase, and lipase. Such 
enzymes may be hyperthermophilic. In an embodiment, the enzyme accumulates in the vacuole, 
endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of a plant. Moreover, in 
another embodiment, the genome of plant may be further augmented with a second recombinant 
polynucleotide comprising a non-hyperthermophilic enzyme. 

In another aspect of the invention, provided is a transformed plant, the genome of which 
is augmented with a recombinant polynucleotide encoding at least one processing enzyme 
selected from the group consisting of a-amylase, glucoamylase, glucose isomerase, a- 
glucosidase, pullulanase, xylanase, cellulase, protease, glucanase, beta glucosidase, esterase, 
phytase or lipase operably linked to a promoter sequence, the sequence of which polynucleotide 
is optimized for expression in the plant. 

Another embodiment is directed to a transformed maize plant, the genome of which is 
augmented with a recombinant polynucleotide encoding at least one processing enzyme selected 
from the group consisting of a-amylase, glucoamylase, glucose isomerase, a-glucosidase, 
pullulanase, xylanase, cellulase, protease, glucanase, phytase, beta glucosidase, esterase, or 
lipase operably linked to a promoter sequence, the sequence of which polynucleotide is 
optimized for expression in the maize plant. 

A transformed plant, the genome of which is augmented with a recombinant 
polynucleotide having SEQ ID NO: 83 operably linked to a promoter and to a signal sequence is 
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provided. Additionally, a transformed plant, the genome of which is augmented with a 
recombinant polynucleotide having the SEQ ID NO: 93 or 94 operably linked to a promoter and 
to a signal sequence is described. In another embodiment, a transformed plant, the genome of 
which is augmented with a recombinant polynucleotide having SEQ ID NO: 95, operably linked 
to a promoter and to a signal sequence. Moreover, a transformed plant, the genome of which is 
augmented with a recombinant polynucleotide having SEQ ID NO: 96 is described. Also 
described is a transformed plant, the genome of which is augmented with a recombinant 
polynucleotide having SEQ ED NO: 97. Also described is a transformed plant, the genome of 
which is augmented with a recombinant polypeptide having SEQ ED NO: 99. 

Products of the transformed plants are further envisioned herein. The product for 
example, include seed, fruit, or grain. The product may alternatively be the processing enzyme, 
starch or sugar. 

A plant obtained from a stably transformed plant of the present invention is further 
described. In this aspect, the plant may be a hybrid plant or an inbred plant. 

A starch composition is a further embodiment of the invention comprising at least one 
processing enzyme which is a protease, glucanase, or esterase. 

Grain is another embodiment of the invention comprising at least one processing enzyme, 
which is an a-amylase, pullulanase, a -glucosidase, glucoamylase, glucose isomerase, xylanase, 
cellulase, glucanase, beta glucosidase, esterase, protease, lipase or phytase. 

In another embodiment, a method of preparing starch granules, comprising 
treating grain which comprises at least one non-starch processing enzyme under conditions 
which activate the at least one enzyme, yielding a mixture comprising starch granules and non- 
starch degradation products, wherein the grain is obtained from a transformed plant, the genome 
of which is augmented with an expression cassette encoding the at least one enzyme; and 
separating starch granules from the mixture is provided. Therein, the enzyme may be a protease, 
glucanase, xylanase, phytase, lipase, beta glucosidase, cellulase or esterase. Moreover, the 
enzyme is preferably hyperthermophilic. The grain may be cracked grain and/or may be treated 
under low or high moisture conditions. Alternativley, the grain may treated with sulfur dioxide. 
The present invention may further comprise separating non-starch products from the mixture. 

The starch products and non-starch products obtained by this method are further described. 
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In yet another embodiment, a method to produce hypersweet corn comprising treating 
transformed corn or a part thereof, the genome of which is augmented with and expresses in the 
endosperm an expression cassette encoding at least one starch-degrading or starch-isomerizing 
enzyme, under conditions which activate the at least one enzyme so as to convert 
polysaccharides in the corn into sugar, yielding hypersweet corn is provided. The expression 
cassette may further comprises a promoter operably linked to the polynucleotide encoding the 
enzyme. The promoter may be a constitutive promoter, seed-specific promoter, or endosperm- 
specific promoter, for example. The enzyme may be hyperthermophilic and may be an a- 
amylase. The expression cassette used herein may further comprise a polynucleotide which 
encodes a signal sequence operably linked to the at least one enzyme. The signal sequence may 
direct the enzyme to the apoplast or the endoplasmic reticulum, for example. The enzyme 
comprises any one of SEQ ED NO: 13, 14, 15, 16, 33, or 35. The enzyme may also comprise 
SEQ ID NO: 87. 

In a most preferred embodiment, a method of producing hypersweet corn comprising 
treating transformed corn or a part thereof, the genome of which is augmented with and 
expresses in the endosperm an expression cassette encoding an a-amylase, under conditions 
which activate the at least one enzyme so as to convert polysaccharides in the corn into sugar, 
yielding hypersweet corn is described. The enzyme may be hyperthermophilic and the 
hyperthermophilic a-amylase may comprise the amino acid sequence of any of SEQ ID NO: 10, 
13, 14, 15, 16, 33, or 35, or an enzymatically active fragment thereof having a-amylase activity. 
The enzyme comprise SEQ ID NO: 87. 

A method to prepare a solution of hydrolyzed starch product comprising; 
treating a plant part comprising starch granules and at least one processing enzyme under 
conditions which activate the at least one enzyme thereby processing the starch granules to form 
an aqueous solution comprising hydrolyzed starch product, wherein the plant part is obtained 
from a transformed plant, the genome of which is augmented with an expression cassette 
encoding the at least one starch processing enzyme; and 

collecting the aqueous solution comprising the hydrolyzed starch product is described herein. 

The hydrolyzed starch product may comprise a dextrin, maltooligosaccharide, glucose and/or 

mixtures thereof. The enzyme may be a-amylase, a-glucosidase, glucoamylase, pullulanase, 
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amylopullulanase, glucose isomerase, or any combination thereof. Moreover, the enzyme may 
be hyperthermophilic. In another aspect, the genome of the plant part may be further augmented 
with an expression cassette encoding a non-hyperthermophilic starch processing enzyme. The 
non-hyperthermophilic starch processing enzyme may be selected from the group consisting of 
amylase, glucoamylase, a-glucosidase, pullulanase, glucose isomerase, or a combination thereof. 
In yet another aspect, the processing enzyme is preferably expressed in the endosperm. The 
plant part may be grain, and from corn, wheat, barley, rye, oat, sugar cane or rice. The at least 
one processing enzyme is operably linked to a promoter and to a signal sequence that targets the 
enzyme to the starch granule or the endoplasmic reticulum, or to the cell wall. The method may 
further comprise isolating the hydrolyzed starch product and/or fermenting the hydrolyzed starch 
product. 

In another aspect of the invention, a method of preparing hydrolyzed starch product 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising a hydrolyzed starch product, wherein the plant 
part is obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding at least one a-amylase; and 

collecting the aqueous solution comprising hydrolyzed starch product is described. The a- 
amylase may be hyperthermophilic and the hyperthermophilic a-amylase comprises the amino 
acid sequence of any of SEQ ID NO: 1, 10, 13, 14, 15, 16, 33, or 35, or an active fragment 
thereof having a-amylase activity. The expression cassette may comprise a polynucleotide 
selected from any of SEQ ED NO: 2, 9, 46, or 52, a complement thereof, or a polynucleotide that 
hybridizes to any of SEQ ID NO: 2, 9, 46, or 52 under low stringency hybridization conditions 
and encodes a polypeptide having a-amylase activity. Moreover, the invention further provides 
for the genome of the transformed plant further comprising a polynucleotide encoding a non- 
thermophilic starch-processing enzyme. Alternatively, the plant part may be treated with a non- 
hyperthermophilic starch-processing enzyme. 

The present invention is further directed to a transformed plant part comprising at least 
one starch-processing enzyme present in the cells of the plant, wherein the plant part is obtained 

from a transformed plant, the genome of which is augmented with an expression cassette 
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encoding the at least one starch processing enzyme. Preferably, the enzyme is a starch- 
processing enzyme selected from the group consisting of a-amylase, glucoamylase, glucose 
isomerase, p-amylase, a-glucosidase, isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
and amylopullulanase. Moreover, the enzyme may be hyperthermophilic. The plant may be any 
plant, such as corn or rice for example. 

Another embodiment of the invention is a transformed plant part comprising at least one 
non-starch processing enzyme present in the cell wall or the cells of the plant, wherein the plant 
part is obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one non-starch processing enzyme or at least one non-starch 
polysaccharide processing enzyme. The enzyme may be hyperthermophilic. Moreover, the non- 
starch processing enzyme may be a protease, glucanase, xylanase, esterase, phytase, beta 
glucosidase, cellulase or lipase. The plant part can be any plant part, but preferably is an ear, 
seed, fruit, grain, stover, chaff, or bagasse. 

The present invention is also directed to transformed plant parts. For example, a 
transformed plant part comprising an a-amylase having an amino acid sequence of any of SEQ 
CD NO: 1, 10, 13, 14, 15, 16, 33, or 35, or encoded by a polynucleotide comprising any of SEQ 
ID NO: 2, 9, 46, or 52, a transformed plant part comprising an a-glucosidase having an amino 
acid sequence of any of SEQ ED NO: 5, 26 or 27, or encoded by a polynucleotide comprising 
SEQ ID NO:6, a transformed plant part comprising a glucose isomerase having the amino acid 
sequence of any one of SEQ ID NO: 28, 29, 30, 38, 40, 42, or 44, or encoded by a 
polynucleotide comprising any one of SEQ ED NO: 19, 21, 37, 39, 41, or 43, a transformed plant 
part comprising a glucoamylase having the amino acid sequence of SEQ ED NO:45 or SEQ ED 
NO:47, or SEQ ID NO:49, or encoded by a 

polynucleotide comprising any of SEQ ID NO: 46, 48, 50, or 59, and a transformed plant part 
comprising a pullulanase encoded by a polynucleotide comprising any of SEQ ID NO: 4 or 25 
are described. 

The present invention is also directed to transformed plant parts. For example, a 
transformed plant part comprising a xylanase having an amino acid sequence of any of SEQ ID 
NO: 62, 64 or 66, or encoded by a polynucleotide comprising any of SEQ ID NO: 61, 63, or 65. 
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A transformed plant part comprising a protease is also provided. The protease may be bromelain 
having an amino acid sequence as set forth in SEQ ID NO: 70, or encoded by a polynucleotide 
having SEQ ID NO: 69. In another embodiment, a transformed plant part comprising a cellulase 
is provided. The cellulase may be a cellobiohydrolase encoded by a polynucleotide comprising 
any of SEQ ID NO: 79, 80, 81, 82, 93 or 94. 

An additional embodiment provides a transformed plant part a glucanase, such as an 
endoglucanase. The endoglucanase may be endoglucanase I which has an amino acid sequence 
as in SEQ ED NO: 84, or encoded by a polynucleotide comprising SEQ ID NO: 83. A 
transformed plant part comprising a beta glucosidase is also provided. The beta glucosidase is 
may be beta glucosidase 2 or beta glucosidase D, which have an amino acid sequence set forth in 
SEQ ED NO: 90 or 92, or encoded by a polynucleotide having SEQ ID NO: 89 or 91 . In another 
embodiment, a transformed plant part comprising an esterase is provided. The esterase may be a 
ferulic acid esterase encoded by a polynucleotide comprising SEQ ID NO: 99. 

Another embodiment is a method of converting starch in the transformed plant part 
comprising activating the starch processing enzyme contained therein. The starch, dextrin, 
maltooligosaccharide or sugar produced according to this method is further described. 

The present invention further describes a method of using a transformed plant part 
comprising at least one non-starch processing enzyme in the cell wall or the cell of the plant part, 
comprising treating a transformed plant part comprising at least one non-starch polysaccharide 
processing enzyme under conditions so as to activate the at least one enzyme thereby digesting 
non-starch polysaccharide to form an aqueous solution comprising oligosaccharide and/or 
sugars, wherein the plant part is obtained from a transformed plant, the genome of which is 
augmented with an expression cassette encoding the at least one non-starch polysaccharide 
processing enzyme; and collecting the aqueous solution comprising the oligosaccharides and/or 
sugars. The non-starch polysaccharide processing enzyme may be hyperthermophilic. 

A method of using transformed seeds comprising at least one processing enzyme, 

comprising treating transformed seeds which comprise at least one protease or lipase under 

conditions so as the activate the at least one enzyme yielding an aqueous mixture comprising 

amino acids and fatty acids, wherein the seed is obtained from a transformed plant, the genome 

of which is augmented with an expression cassette encoding the at least one enzyme; and 
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collecting the aqueous mixture. The amino acids, fatty acids or both are preferably isolated. The 
at least one protease or lipase may be hyperthermophilic. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
polysaccharide processing enzyme under conditions to activate the at least one enzyme thereby 
digesting polysaccharide to form oligosaccharide or fermentable sugar, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one polysaccharide processing enzyme; and incubating the 
fermentable sugar under conditions that promote the conversion of the fermentable sugar or 
oligosaccharide into ethanol. The 

plant part may be a grain, fruit, seed, stalks, wood, vegetable or root. The plant part may be 
obtained from a plant selected from the group consisting of oats, barley, wheat, berry, grapes, 
rye, com, rice, potato, sugar beet, sugar cane, pineapple, grasses and trees. 
In another preferred embodiment, the polysaccharide processing enzyme is a-amylase, 
glucoamylase, a-glucosidase, glucose isomerase, pullulanase, or a combination thereof. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
enzyme selected from the group consisting of a-amylase, glucoamylase, a-glucosidase, glucose 
isomerase, or pullulanase, or a combination thereof, with heat for an amount of time and under 
conditions to activate the at least one enzyme thereby digesting polysaccharide to form 
fermentable sugar, wherein the plant part is obtained from a transformed plant, the genome of 
which is augmented with an expression cassette encoding the at least one enzyme; and 
incubating the fermentable sugar under conditions that promote the conversion of the 
fermentable sugar into ethanol is provided. The at least one enzyme may be hyperthermophilic 
or mesophilic. 

In another embodiment, a method to prepare ethanol comprising treating a plant part 
comprising at least one non-starch processing enzyme under conditions to activate the at least 
one enzyme thereby digesting non-starch polysaccharide to oligosaccharide and fermentable 
sugar, wherein the plant part is obtained from a transformed plant, the genome of which is 
augmented with an expression cassette encoding the at least one enzyme; and incubating the 
fermentable sugar under conditions that promote the conversion of the fermentable sugar into 
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ethanol is provided. The non-starch processing enzyme may be a xylanase, cellulase, glucanase, 
beta glucosidase, protease, esterase, lipase or phytase. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
enzyme selected from the group consisting of a-amylase, glucoamylase, a-glucosidase, glucose 
isomerase, or pullulanase, or a combination thereof, under conditions to activate the at least one 
enzyme thereby digesting polysaccharide to form fermentable sugar, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one enzyme; and incubating the fermentable sugar under 
conditions that promote the conversion of the fermentable sugar into ethanol is further provided. 
The enzyme may be hyperthermophilic. 

Moreover, a method to produce a sweetened farinaceous food product without adding 
additional sweetener comprising treating a plant part comprising at least one starch processing 
enzyme under conditions which activate the at least one enzyme, thereby processing starch 
granules in the plant part to sugars so as to form a sweetened product, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one enzyme; and processing the sweetened product into a 
farinaceous food product is described. The farinaceous food product may be formed from the 
sweetened product and water. Moreover, the farinaceous food product may contain malt, 
flavorings, vitamins, minerals, coloring agents or any combination thereof. The at least one 
enzyme may be hyperthermophilic. The enzyme may be selected from a-amylase, a- 
glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
plant may further be selected from the group consisting of soybean, rye, oats, barley, wheat, 
com, rice and sugar cane. The farinaceous food product may be a cereal food, a breakfast food, a 
ready to eat food, or a baked food. The processing may include baking, boiling, heating, 
steaming, electrical discharge or any combination thereof. 

The present invention is further directed to a method to sweeten a starch-containing 
product without adding sweetener comprising treating starch comprising at least one starch 
processing enzyme under conditions to activate the at least one enzyme thereby digesting the 
starch to form a sugar to form sweetened starch, wherein the starch is obtained from a 
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transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one enzyme; and adding the sweetened starch to a product to produce a sweetened starch 
containing product. The transformed plant may be selected from the group consisting of corn, 
soybean, rye, oats, barley, wheat, rice and sugar cane. The at least one enzyme may be 
hyperthermophilic. The at least one enzyme may be oc-amylase, a-glucosidase, glucoamylase, 
pullulanase, glucose isomerase, or any combination thereof 

A farinaceous food product and sweetened starch-containing product is provided for 

herein. 

The invention is also directed to a method to sweeten a polysaccharide-containing fruit or 
vegetable comprising treating a fruit or vegetable comprising at least one polysaccharide 
processing enzyme under conditions which activate the at least one enzyme, thereby processing 
the polysaccharide in the fruit or vegetable to form sugar, yielding a sweetened fruit or 
vegetable, wherein the fruit or vegetable is obtained from a transformed plant, the genome of 
which is augmented with an expression cassette encoding the at least one polysaccharide 
processing enzyme. The fruit or vegetable is selected from the group consisting of potato, 
tomato, banana, squash, peas, and beans. The at least one enzyme may be hyperthermophilic. 

The present invention is further directed to a method of preparing an aqueous solution 
comprising sugar comprising treating starch granules obtained from the plant part under 
conditions which activate the at least one enzyme, thereby yielding an aqueous solution 
comprising sugar. 

Another embodiment is directed to a method of preparing starch derived products from 
grain that does not involve wet or dry milling grain prior to recovery of starch-derived products 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising dextrins or sugars, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one starch processing enzyme; and collecting the aqueous solution 
comprising the starch derived product. The at least one starch processing enzyme may be 
hyperthermophilic. 
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A method of isolating an a-amylase, glucoamylase, glucose isomerase, a-glucosidase, 
and pullulanase comprising culturing a transformed plant containing the a-amylase, 
glucoamylase, glucose isomerase, a-glucosidase, or pullulanase and isolating the a-amylase, 
glucoamylase, glucose isomerase, a-glucosidase or pullulanase therefrom is further provided. 
Also provided is a method of isolating a xylanase, cellulase, glucanase, beta glucosidase, 
protease, esterase, phytase or lipase comprising culturing a transformed plant containing the 
xylanase, cellulase, glucanase, beta glucosidase, protease, esterase, phytase or lipase and 
isolating the xylanase, cellulase, glucanase, esterase, beta glucosidase, protease, esterase, phytase 
or lipase . 

A method of preparing maltodextrin comprising mixing transgenic grain with water, 
heating said mixture, separating solid from the dextrin syrup generated, and 
collecting the maltodextrin. The transgenic grain comprises at least one starch processing 
enzyme. The starch processing enzyme may be a-amylase, glucoamylase, a-glucosidase, and 
glucose isomerase. Moreover, maltodextrin produced by the method is provided as well as 
composition produced by this method. 

A method of preparing dextrins, or sugars from grain that does not involve mechanical 
disruption of the grain prior to recovery of starch-derived comprising: 

treating a plant part comprising starch granules and at least one starch processing enzyme under 
conditions which activate the at least one enzyme thereby processing the starch granules to form 
an aqueous solution comprising dextrins or sugars, wherein the plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one processing enzyme; and 

collecting the aqueous solution comprising sugar and/or dextrins is provided. 

The present invention is further directed to a method of producing fermentable sugar 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising dextrins or sugars, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one processing enzyme; and 
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collecting the aqueous solution comprising the fermentable sugar. 

Moreover, a maize plant stably transformed with a vector comprising a hyperthermophlic 
a-amylase is provided herein. For example, preferably, a maize plant stably transformed with a 
vector comprising a polynucleotide sequence that encodes a-amylase that is greater than 60% 



Figures 1 A and IB illustrate the activity of a-amylase expressed in corn kernels and in 
the endosperm from segregating Tl kernels from pNOV6201 plants and from six pNOV6200 

lines. 

Figure 2 illustrates the activity of a-amylase in segregating Tl kernels from pNOV6201 

lines. 

Figure 3 depicts the amount of ethanol produced upon fermentation of mashes of 
transgenic corn containing thermostable 797GL3 alpha amylase that were subjected to 
liquefaction times of up to 60 minutes at 85°C and 95°C. This figure illustrates that the ethanol 
yield at 72 hours of fermentation was almost unchanged from 15 minutes to 60 minutes of 
liquefaction. Moreover, it shows that mash produced by liquefaction at 95°C produced more 
ethanol at each time point than mash produced by liquefaction at 85°C. 

Figure 4 depicts the amount of residual starch (%) remaining after fermentation of 
mashes of transgenic corn containing thermostable alpha amylase that were subjected to a 
liquefaction time of up to 60 minutes at 85°C and 95°C. This figure illustrates that the ethanol 
yield at 72 hours of fermentation was almost unchanged from 15 minutes to 60 minutes of 
liquefaction. Moreover, it shows that mash produced by liquefaction at 95°C produced more 
ethanol at each time point than mash produced by liquefaction at 85°C. 

Figure 5 depicts the ethanol yields for mashes of a transgenic corn, control corn, and 
various mixtures thereof prepared at 85°C and 95°C. This figure illustrates that the transgenic 
com comprising a-amylase results in significant improvement in making starch available for 
fermentation since there was a reduction of starch left over after fermentation. 



identical to SEQ ID NO: 1 or SEQ ID NO: 51 is encompassed. 
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Figure 6 depicts the amount of residual starch measured in dried stillage following 
fermentation for mashes of a transgenic grain, control corn, and various mixtures thereof at 
prepared at 85°C and 95°C. 

Figure 7 depicts the ethanol yields as a function of fermentation time of a sample 
comprising 3% transgenic corn over a period of 20-80 hours at various pH ranges from 5.2-6.4. 
The figure illustrates that the fermentation conducted at a lower pH proceeds faster than at a pH 
of 6.0 or higher. 

Figure 8 depicts the ethanol yields during fermentation of a mash comprising various 
weight percentages of transgenic corn from 0-12 wt% at various pH ranges from 5.2-6.4. This 
figure illustrates that the ethanol yield was independent of the amount of transgenic grain 
included in the sample. 

Figure 9 shows the analysis of T2 seeds from different events transformed with pNOV 
7005. High expression of pullulanase activity, compared to the non-transgenic control, can be 
detected in a number of events. 

Figure 10A and 1 0B show the results of the HPLC analysis of the hydrolytic products 
generated by expressed pullulanase from starch in the transgenic corn flour. Incubation of the 
flour of pullulanase expressing corn in reaction buffer at 75 °C for 30 minutes results in 
production of medium chain oligosaccharides (degree of polymerization (DP) -10-30) and short 
amy lose chains (DP - 100 -200) from cornstarch. Figures 10A and 10B also show the effect of 
added calcium ions on the activity of the pullulanase. 

Figures 1 1 A and 1 IB depict the data generated from HPLC analysis of the starch 
hydrolysis product from two reaction mixtures. The first reaction indicated as 'Amylase* 
contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase expressing transgenic corn 
and non-transgenic corn A 188; and the second reaction mixture 'Amylase + Pullulanase 1 
contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase expressing transgenic com 
and pullulanase expressing transgenic corn. 

Figure 12 depicts the amount of sugar product in jig in 25 /xl of reaction mixture for two 
reaction mixtures. The first reaction indicated as 'Amylase' contains a mixture [1:1 (w/w)] of 
com flour samples of a -amylase expressing transgenic corn and non-transgenic com A 188; and 
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the second reaction mixture 'Amylase + Pullulanase* contains a mixture [1:1 (w/w)] of corn flour 
samples of a -amylase expressing transgenic corn and pullulanase expressing transgenic com. 

Figure 13A and 13B shows the starch hydrolysis product from two sets of reaction 
mixtures at the end of 30 minutes incubation at 85°C and 95°C. For each set there are two 
reaction mixtures; the first reaction indicated as 'Amylase X Pullulanase' contains flour from 
transgenic corn (generated by cross pollination) expressing both the a-amylase and the 
pullulanase, and the second reaction indicated as 'Amylase' mixture of corn flour samples of a - 
amylase expressing transgenic corn and non-transgenic corn A188 in a ratio so as to obtain same 
amount of a -amylase activity as is observed in the cross (Amylase X Pullulanase). 

Figure 14 depicts the degradation of starch to glucose using non-transgenic corn seed 
(control), transgenic com seed comprising the 797GL3 a-amylase, and a combination of 797GL3 
transgenic com seed with Mai A a-glucosidase. 

Figure 15 depicts the conversion of raw starch at room temperature or 30°C. In this 
figure, the reaction mixtures 1 and 2 are a combination of water and starch at room temperature 
and 30°C, respectively. Reaction mixtures 3 and 4 are a combination of barley a-amylase and 
starch at room temperature and at 30°C, respectively. Reaction mixtures 5 and 6 are 
combinations of Thermoanaerobacterium glucoamylase and starch at room temperature and 
30°C, respectively, Reactions mixtures 7 and 8 are combinations of barley a-amylase (sigma) 
and Thermoanaerobacterium glucoamylase and starch at room temperature and 30°C, 
respectively. Reaction mixtures 9 and 10 are combinations of Barley alpha-amylase (sigma) 
control, and starch at room temperature and 30°C, respectively. The degree of polymerization 
(DP) of the products of the Thermoanaerobacterium glucoamylase is indicated. 

Figure 16 depicts the production of fructose from amylase transgenic com flour using a 
combination of alpha amylase, alpha glucosidase, and glucose isomerase as described in 
Example 19. Amylase com flour was mixed with enzyme solutions plus water or buffer. All 
reactions contained 60 mg amylase flour and a total of 600jal of liquid and were incubated for 2 
hours at 90°C 

Figure 17 depicts the peak areas of the products of reaction with 100% amylase flour 
from a self-processing kernel as a function of incubation time from 0-1200 minutes at 90°C 

22 



WO 2005/096804 



PCT/US2004/007182 



Figure 18 depicts the peak areas of the products of reaction with 10% transgenic amylase 
flour from a self-processing kernel and 90% control corn flour as a function of incubation time 
from 0-1 200 minutes at 90°C. 

Figure 19 provides the results of the HPLC analysis of transgenic amylase flour 
incubated at 70°, 80°, 90*, or 100° C for up to 90 minutes to assess the effect of temperature on 
starch hydrolysis. 

Figure 20 depicts ELSD peak area for samples containing 60 mg transgenic amylase flour 
mixed with enzyme solutions plus water or buffer under various reaction conditions. One set of 
reactions was buffered with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 
1 mM CoCl 2 ; in a second set of reactions the metal-containing buffer solution was replaced by 
water. All reactions were incubated for 2 hours at 90°C. 

Detailed Description of the Invention 

In accordance with the present invention, a "self-processing" plant or plant part has 
incorporated therein an isolated polynucleotide encoding a processing enzyme capable of 
processing, e.g., modifying, starches, polysaccharides, lipids, proteins, and the like in plants, 
wherein the processing enzyme can be mesophilic, thermophilic or hyperthermophilic, and may 
be activated by grinding, addition of water, heating, or otherwise providing favorable conditions 
for function of the enzyme. The isolated polynucleotide encoding the processing enzyme is 
integrated into a plant or plant part for expression therein. Upon expression and activation of the 
processing enzyme, the plant or plant part of the present invention processes the substrate upon 
which the processing enzyme acts. Therefore, the plant or plant parts of the present invention are 
capable of self-processing the substrate of the enzyme upon activation of the processing enzyme 
contained therein in the absence of or with reduced external sources normally required for 
processing these substrates. As such, the transformed plants, transformed plant cells, and 
transformed plant parts have "built-in" processing capabilities to process desired substrates via 
the enzymes incorporated therein according to this invention. Preferably, the processing 
enzyme-encoding polynucleotide are "genetically stable," i.e., the polynucleotide is stably 

* 

maintained in the transformed plant or plant parts of the present invention and stably inherited by 
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progeny through successive generations. 

In accordance with the present invention, methods which employ such plants and plant 
parts can eliminate the need to mill or otherwise physically disrupt the integrity of plant parts 
prior to recovery of starch-derived products. For example, the invention provides improved 
methods for processing corn and other grain to recover starch-derived products. The invention 
also provides a method which allows for the recovery of starch granules that contain levels of 
starch degrading enzymes, in or on the granules, that are adequate for the hydrolysis of specific 
bonds within the starch without the requirement for adding exogenously produced starch 
hydrolyzing enzymes. The invention also provides improved products from the self-processing 
plant or plant parts obtained by the methods of the invention. 

In addition, the "self-processing" transformed plant part, e.g., grain, and transformed 
plant avoid major problems with existing technology, i.e., processing enzymes are typically 
produced by fermentation of microbes, which requires isolating the enzymes from the culture 
supernatants, which costs money; the isolated enzyme needs to be formulated for the particular 
application, and processes and machinery for adding, mixing and reacting the enzyme with its 
substrate must be developed. The transformed plant of the invention or a part thereof is also a 
source of the processing enzyme itself as well as substrates and products of that enzyme, such as 
sugars, amino acids, fatty acids and starch and non-starch polysaccharides. The plant of the 
invention may also be employed to prepare progeny plants such as hybrids and inbreds. 

Processing Enzymes And Polynucleotides Encoding Them 

A polynucleotide encoding a processing enzyme (mesophilic, thermophilic, or 
hyperthermophilic) is introduced into a plant or plant part. The processing enzyme is selected 
based on the desired substrate upon which it acts as found in plants or transgenic plants and/or 
the desired end product. For example, the processing enzyme may be a starch-processing 
enzyme, such as a starch-degrading or starch-isomerizing enzyme, or a non-starch processing 
enzyme. Suitable processing enzymes include, but are not limited to, starch degrading or 
isomerizing enzymes including, for example, ot-amylase, endo or exo-1,4, or 1 ,6-a-D, 
glucoamylase, glucose isomerase, p-amylases, ot-glucosidases, and other exo-amylases; and 
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starch debranching enzymes, such as isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
amylopullulanase and the like, glycosyl transferases such as cyclodextrin glycosyltransferase and 
the like, cellulases such as exo-l,4-p-cellobiohydrolase, exo-l,3-p-D-glucanase, hemicellulase, 
P-glucosidase and the like; endoglucanases such as endo-l,3-P-glucanase and endo-l,4-P- 
glucanase and the like; L-arabinases, such as endo-l,5-a-L-arabinase, a-arabinosidases and the 
like; galactanases such as endo-l,4-P-D-galactanase, endo-l,3-p-D-galactanase, p-galactosidase, 
a-galactosidase and the like; mannanases, such as endo-l,4-P-D-mannanase, P-mannosidase, <x- 
mannosidase and the like; xylanases, such as endo-l,4-p-xylanase, P-D-xylosidase, 1,3-p-D- 
xylanase, and the like; and pectinases; and non-starch processing enzymes, including protease, 
glucanase, xylanase, thioredoxin/thioredoxin reductase, esterase, phytase, and lipase. 

In one embodiment, the processing enzyme is a starch-degrading enzyme selected from 
the group of a-amylase, pullulanase, a-glucosidase, glucoamylase, amylopullulanase, glucose 
isomerase, or combinations thereof. According to this embodiment, the starch-degrading 
enzyme is able to allow the self-processing plant or plant part to degrade starch upon activation 
of the enzyme contained in the plant or plant part, as will be further described herein. The 
starch-degrading enzyme(s) is selected based on the desired end-products. For example, a 
glucose-isomerase may be selected to convert the glucose (hexose) into fructose. Alternatively, 
the enzyme may be selected based on the desired starch-derived end product with various chain 
lengths based on, e.g., a function of the extent of processing or with various branching patterns 
desired. For example, an a -amylase, glucoamylase, or amylopullulanase can be used under 
short incubation times to produce dextrin products and under longer incubation times to produce 
shorter chain products or sugars. A pullulanase can be used to. specifically hydrolyze branch 
points in the starch yielding a high-amylose starch, or a neopullulanase can be used to produce 
starch with stretches of a 1,4 linkages with interspersed a 1,6 linkages. Glucosidases could be 
used to produce limit dextrins, or a combination of different enzymes to make other starch 
derivatives. 

In another embodiment, the processing enzyme is a non-starch processing enzyme 
selected from protease, glucanase, xylanase, phytase, lipase, cellulase, beta glucosidase and 
esterase. These non-starch degrading enzymes allow the self-processing plant or plant part of the 

25 



WO 2(105/096804 



PCTAJS2004/007182 



present invention to incorporate in a targeted area of the plant and, upon activation, disrupt the 
plant while leaving the starch granule therein intact. For example, in a preferred embodiment, 
the non-starch degrading enzymes target the endosperm matrix of the plant cell and, upon 
activation, disrupt the endosperm matrix while leaving the starch granule therein intact and more 
readily recoverable from the resulting material. 

Combinations of processing enzymes are further envisioned by the present invention. 
For example, starch-processing and non-starch processing enzymes may be used in combination. 
Combinations of processing enzymes may be obtained by employing the use of multiple gene 
constructs encoding each of the enzymes. Alternatively, the individual transgenic plants stably 
transformed with the enzymes may be crossed by known methods to obtain a plant containing 
both enzymes. Another method includes the use of exogenous enzyme(s) with the transgenic 
plant. 

The processing enzymes may be isolated or derived from any source and the 
polynucleotides corresponding thereto may be ascertained by one having skill in the art. For 
example, the processing enzyme, such as a-amylase, is derived from the Pyrococcus (e.g., 
Pyrococcus furiosus), Thermus, Thermococcus (e.g., Thermococcus hydrothermalis), Sulfolobus 
(e.g., Sulfolobus solfataricus) Thermotoga (e.g., Thermotoga maritima and Thermotoga 
neapolitana), Thermoanaerobacterium (e.g. Thermoanaerobacter tengcongensis), Aspergillus 
(e.g., Aspergillus shirousami and Aspergillus niger), Rhizopus (eg., Rhizopus oryzae), 
Thermoproteales, Desulfiirococcus (e.g. Desulfurococcus amylolyticus), Methanobacterium 
ihermoautotrophicum, Methanococcus jannaschii, Methanopyrus kandleri, 
Thermosynechococcus elongatus, Tlxermoplasma acidophilum, Tfiermoplasma volcanium, 
Aeropyrum pernix and plants such as com, barley, and rice. 

The processing enzymes of the present invention are capable of being activated after 
being introduced and expressed in the genome of a plant. Conditions for activating the enzyme 
are determined for each individual enzyme and may include varying conditions such as 
temperature, pH, hydration, presence of metals, activating compounds, inactivating compounds, 
etc. For example, temperature-dependent enzymes may include mesophilic, thermophilic, and 
hyperthermophilic enzymes. Mesophilic enzymes typically have maximal activity at 
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temperatures between 20°- 65°C and are inactivated at temperatures greater than 70° C. 
Mesophilic enzymes have significant activity at 30 to 37°C, the activity at 30 °C is preferably at 
least 10% of maximal activity, more preferably at least 20% of maximal activity. 

Thermophilic enzymes have a maximal activity at temperatures of between 50 and 80° C 
and are inactivated at temperatures greater than 80°C . A thermophilic enzyme will preferably 
have less than 20% of maximal activity at 30°C, more preferably less than 10% of maximal 
activity. 

A "hypertherrnophilic" enzyme has activity at even higher temperatures. 
Hypertherrnophilic enzymes have a maximal activity at temperatures greater than 80° C and 
retain activity at temperatures at least 80°C, more preferably retain activity at temperatures of at 
least 90°C and most preferably retain activity at temperatures of at least 95°C. 
Hypertherrnophilic enzymes also have reduced activity at low temperatures. A 
hypertherrnophilic enzyme may have activity at 30°C that is less than 10% of maximal activity, 
and preferably less than 5% of maximal activity. 

The polynucleotide encoding the processing enzyme is preferably modified to include 
codons that are optimized for expression in a selected organism such as a plant (see, e.g., Wada 
et al., Nucl. Acids Res. . 18:2367 (1990), Murray et al., Nucl. Acids Res. , 17:477 (1989), U.S. 
Patent Nos. 5,096,825, 5,625,136, 5,670,356 and 5,874,304). Codon optimized sequences are 
synthetic sequences, i.e., they do not occur in nature, and preferably encode the identical 
polypeptide (or an enzymatically active fragment of a full length polypeptide which has 
substantially the same activity as the frill length polypeptide) encoded by the non-codon 
optimized parent polynucleotide which encodes a processing enzyme. It is preferred that the 
polypeptide is biochemically distinct or improved, e.g., via recursive mutagenesis of DNA 
encoding a particular processing enzyme, from the parent source polypeptide such that its 
performance in the process application is improved. Preferred polynucleotides are optimized for 
expression in a target host plant and encode a processing enzyme. Methods to prepare these 
enzymes include mutagenesis, e.g., recursive mutagenesis and selection. Methods for mutagenesis 
and nucleotide sequence alterations are well-known in the art. See, for example, Kunkel, Proc. 
Natl. Acad. Sci. USA . 82:488, (1985); Kunkel et al., Methods in EnzvmoL 154:367 (1987); US 
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Patent No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology 
(MacMillan Publishing Company, New York) and the references cited therein and Arnold et al., 
Chem. Eng. Sci. , 51:5091 (1996)). Methods to optimize the expression of a nucleic acid segment 
in a target plant or organism are well-known in the art. Briefly, a codon usage table indicating the 
optimal codons used by the target organism is obtained and optimal codons are selected to replace 
those in the target polynucleotide and the optimized sequence is then chemically synthesized. 
Preferred codons for maize are described in U.S. Patent No. 5,625,136. 

Complementary nucleic acids of the polynucleotides of the present invention are further 
envisioned. An example of low stringency conditions for hybridization of complementary 
nucleic acids which have more than 100 complementary residues on a filter in a Southern or 
Northern blot is 50% formamide, e.g., hybridization in 50% formamide, 1 M NaCl, 1% SDS at 
37°C, and a wash in 0.1 X SSC at 60°C to 65°C. Exemplary low stringency conditions include 
hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium 
dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M 
trisodium citrate) at 50 to 55°C Exemplary moderate stringency conditions include 
hybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 37°C, and a wash in 0.5X to IX 
SSC at 55 to 60°C. 

Moreover, polynucleotides encoding an "enzymatically active" fragment of the 
processing enzymes are further envisioned. As used herein, "enzymatically active" means a 
polypeptide fragment of the processing enzyme that has substantially the same biological activity 
as the processing enzyme to modify the substrate upon which the processing enzyme normally 
acts under appropriate conditions. 

In a preferred embodiment, the polynucleotide of the present invention is a maize- 
optimized polynucleotide encoding a-amylase, such as provided in SEQ ID NOs:2, 9, 46, and 52. 
In another preferred embodiment, the polynucleotide is a maize-optimized polynucleotide 
encoding pullulanase, such as provided in SEQ ID NOs: 4 and 25. In yet another preferred 
embodiment, the polynucleotide is a maize-optimized polynucleotide encoding a-glucosidase as 
provided in SEQ ID NO:6. Another preferred polynucleotide is the maize-optimized 
polynucleotide encoding glucose isomerase having SEQ ID NO: 19, 21, 37, 39, 41, or 43. In 
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another embodiment, the maize-optimized polynucleotide encoding glucoamylase as set forth in 
SEQ ID NO: 46, 48, or 50 is preferred. Moreover, a maize-optimized polynucleotide for 
glucanase/mannanase fusion polypeptide is provided in SEQ ID NO: 57. The invention further 
provides for complements of such polynucleotides, which hybridize under moderate, or 
preferably under low stringency, hybridization conditions and which encodes a polypeptide 
having a-amylase, pullulanase, a-glucosidase, glucose isomerase, glucoamylase, glucanase, or 
mannanase activity, as the case may be. 

The polynucleotide may be used interchangeably with "nucleic acid" or "polynucleic 
acid" and refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- 
or double-stranded form, composed of monomers (nucleotides) containing a sugar, phosphate 
and a base, which is either a purine or pyrimidine. Unless specifically limited, the term 
encompasses nucleic acids containing known analogs of natural nucleotides, which have similar 
binding properties as the reference nucleic acid and are metabolized in a manner similar to 
naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence 
also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon 
substitutions) and complementary sequences as well as the sequence explicitly indicated. 
Specifically, degenerate codon substitutions may be achieved by generating sequences in which 
the third position of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues. 

"Variants" or substantially similar sequences are further encompassed herein. For 
nucleotide sequences, variants include those sequences that, because of the degeneracy of the 
genetic code, encode the identical amino acid sequence of the native protein. Naturally 
occurring allelic variants such as these can be identified with the use of well-known molecular 
biology techniques, as, for example, with polymerase chain reaction (PCR), hybridization 
techniques, and ligation reassembly techniques. Variant nucleotide sequences also include 
synthetically derived nucleotide sequences, such as those generated, for example, by using site- 
directed mutagenesis, which encode the native protein, as well as those that encode a polypeptide 
having amino acid substitutions. Generally, nucleotide sequence variants of the invention will 
have at least 40%, 50%, 60%, preferably 70%, more preferably 80%, even more preferably 90%, 
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most preferably 99%, and single unit percentage identity to the native nucleotide sequence based 
on these classes. For example, 71%, 72%, 73% and the like, up to at least the 90% class. Variants 
may also include a full-length gene corresponding to an identified gene fragment. 

Regulator)' Sequences: Promoters/Signal Sequences/Selectable Markers 

The polynucleotide sequences encoding the processing enzyme of the present invention 
may be operably linked to polynucleotide sequences encoding localization signals or signal 
sequence (at the N- or C-terminus of a polypeptide), e.g., to target the hyperthermophilic enzyme 
to a particular compartment within a plant. Examples of such targets include, but are not limited 
to, the vacuole, endoplasmic reticulum, chloroplast, amyloplast, starch granule, or cell wall, or to 
a particular tissue, e.g., seed. The expression of a polynucleotide encoding a processing enzyme 
having a signal sequence in a plant, in particular, in conjunction with the use of a tissue-specific 
or inducible promoter, can yield high levels of localized processing enzyme in the plant. 
Numerous signal sequences are known to influence the expression or targeting of a 
polynucleotide to a particular compartment or outside a particular compartment. Suitable signal 
sequences and targeting promoters are known in the art and include, but are not limited to, those 
provided herein. 

For example, where expression in specific tissues or organs is desired, tissue-specific 
promoters may be used. In contrast, where gene expression in response to a stimulus is desired, 
inducible promoters are the regulatory elements of choice. Where continuous expression is 
desired throughout the cells of a plant, constitutive promoters are utilized. Additional regulatory 
sequences upstream and/or downstream from the core promoter sequence may be included in 
expression constructs of transformation vectors to bring about varying levels of expression of 
heterologous nucleotide sequences in a transgenic plant. 

A number of plant promoters have been described with various expression characteristics. 
Examples of some constitutive promoters which have been described include the rice actin 1 
(Wang et al., Mol. Cell. Biol. , 12:3399 (1992); U.S. Patent No. 5,641,876), CaMV 35S (Odell et 
al., Nature , 313:810 (1985)), CaMV 19S (Lawton et a!., 1987), nos (Ebert et al., 1987), Adh 
(Walker et al., 1987), sucrose synthase (Yang & Russell, 1990), and the ubiquitin promoters. 
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Vectors for use in tissue-specific targeting of genes in transgenic plants will typically 
include tissue-specific promoters and may also include other tissue-specific control elements 
such as enhancer sequences. Promoters which direct specific or enhanced expression in certain 
plant tissues will be known to those of skill in the art in light of the present disclosure. These 
include, for example, the rbcS promoter, specific for green tissue; the ocs, nos and mas 
promoters which have higher activity in roots or wounded leaf tissue; a truncated (-90 to +8) 35S 
promoter which directs enhanced expression in roots, an ct-tubulin gene that directs expression in 
roots and promoters derived from zein storage protein genes which direct expression in 
endosperm. 

Tissue specific expression may be functionally accomplished by introducing a 
constitutively expressed gene (all tissues) in combination with an antisense gene that is 
expressed only in those tissues where the gene product is not desired. For example, a gene 
coding for a lipase may be introduced such that it is expressed in all tissues using the 35S 
promoter from Cauliflower Mosaic Virus. Expression of an antisense transcript of the lipase 
gene in a maize kernel, using for example a zein promoter, would prevent accumulation of the 
lipase protein in seed. Hence the protein encoded by the introduced gene would be present in all 
tissues except the kernel. 

Moreover, several tissue-specific regulated genes and/or promoters have been reported in 
plants. Some reported tissue-specific genes include the genes encoding the seed storage proteins 
(such as napin, cruciferin, beta-conglycinin, and phaseolin) zein or oil body proteins (such as 
oleosin), or genes involved in fatty acid biosynthesis (including acyl carrier protein, stearoyl- 
ACP desaturase, and fatty acid desaturases (fad 2-1)), and other genes expressed during embryo 
development (such as Bce4, see, for example, EP 255378 and Kridl et al., Seed Science 
Research , 1:209 (1991)). Examples of tissue-specific promoters, which have been described 
include the lectin (Vodkin, Prog. Clin. Biol. Res. , 138:87 (1983); Lindstrom et al., Per. Genet. , 
JJ.:160 (1990)), corn alcohol dehydrogenase 1 (Vogel et al., 1989; Dennis et al., Nucleic Acids 
Res. , 12:3983 (1984)), corn light harvesting complex (Simpson, 1986; Bansal et ah, Proc. Natl. 
Acad. Sci. USA . 89:3654 (1992)), corn heat shock protein (Odell et al., 1985; Rochester et al., 
1986), pea small subunit RuBP carboxylase (Poulsen et al., 1986; Cashmore et al,, 1983), Ti 
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plasmid mannopine synthase (Langridge et al., 1989), Ti plasmid nopaline synthase (Langridge 
et al., 1989), petunia chalcone isomerase (vanTunen et al., EMBO J. , 7;1257(1988)), bean 
glycine rich protein 1 (Keller et al., Genes Dev. , 3:1639 (1989)), truncated CaMV 35s (Odell et 
al., Nature, 313:810 (1985)), potato patatin (Wenzler et al., Plant Mol. BioL 13:347 (1989)), root 
cell (Yamamoto et al., Nucleic Acids Res. , 18:7449 (1990)), maize zein (Reina et al., Nucleic 
Acids Res. , 18:6425 (1990); Kriz et al., Mol. Gen. Genet. . 207:90 (1987); Wandelt et al., Nucleic 
Acids Res. , 17:2354 (1989); Langridge et al., Ceil, 34:1015 (1983); Reina et al., Nucleic Acids 
Res., ]8:7449 (1990)), globulin- 1 (Belanger et a!., Genetics , J29:863 (1991)), a-tubulin, cab 
(Sullivan et al., Mol. Gen. Genet. , 215:431 (1989)), PEPCase (Hudspeth & Grula, 1989), R gene 
complex-associated promoters (Chandler et al., Plant Cell , 1:1 175 (1989)), and chalcone 
synthase promoters (Franken et al., EMBO J. , 10:2605 (1991)). Particularly useful for seed- 
specific expression is the pea vicilin promoter (Czako et al., Mol. Gen. Genet. , 235:33 (1992). 
(See also U.S. Pat. No. 5,625,136, herein incorporated by reference.) Other useful promoters for 
expression in mature leaves are those that are switched on at the onset of senescence, such as the 
SAG promoter from Arabidopsis (Gan et al., Science , 270:1986 (1995). 

A class of fruit-specific promoters expressed at or during anthesis through fruit 
development, at least until the beginning of ripening, is discussed in U.S. 4,943,674, the 
disclosure of which is hereby incorporated by reference. cDNA clones that are preferentially 
expressed in cotton fiber have been isolated (John et al., Proc. Natl. Acad. Sci. USA , 89:5769 
(1992). cDNA clones from tomato displaying differential expression during fruit development 
have been isolated and characterized (Mansson et al., Gen. Genet., 200:356 (1985), Slater et al., 
Plant Mol. Biol. , 5:1 37 (1985)). The promoter for polygalacturonase gene is active in fruit 
ripening. The polygalacturonase gene is described in U.S. Patent No. 4,535,060, U.S. Patent No. 
4,769,061, U.S. Patent No. 4,801,590, and U.S. Patent No. 5,107,065, which disclosures are 
incorporated herein by reference. 

Other examples of tissue-specific promoters include those that direct expression in leaf 
cells following damage to the leaf (for example, from chewing insects), in tubers (for example, 
patatin gene promoter), and in fiber cells (an example of a developmentally-regulated fiber cell 
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protein is E6 (John et al., Proc. Natl. Acad. Sci. USA , 89:5769 (1992). The E6 gene is most 
active in fiber, although low levels of transcripts are found in leaf, ovule and flower. 

The tissue-specificity of some "tissue-specific" promoters may not be absolute and may 
be tested by one skilled in the art using the diphtheria toxin sequence. One can also achieve 
tissue-specific expression with "leaky" expression by a combination of different tissue-specific 
promoters (Beals et al., Plant Cell 9:1 527 (1997)). Other tissue-specific promoters can be 
isolated by one skilled in the art (see U.S. 5,589,379). 

In one embodiment, the direction of the product from a polysaccharide hydrolysis gene, 
such as a-amylase, may be targeted to a particular organelle such as the apoplast rather than to 
the cytoplasm. This is exemplified by the use of the maize y-zein N-terminal signal sequence 
(SEQ ID NO: 1 7), which confers apoplast-specific targeting of proteins. Directing the protein or 
enzyme to a specific compartment will allow the enzyme to be localized in a manner that it will 
not come into contact with the substrate. In this manner the enzymatic action of the enzyme will 
not occur until the enzyme contacts its substrate. The enzyme can be contacted with its substrate 
by the process of milling (physical disruption of the cell integrity), or heating the cells or plant 
tissues to disrupt the physical integrity of the plant cells or organs that contain the enzyme. For 
example a mesophilic starch-hydrolyzing enzyme can be targeted to the apoplast or to the 
endoplasmic reticulum and so as not to come into contact with starch granules in the amyloplast. 
Milling of the grain will disrupt the integrity of the grain and the starch hydrolyzing enzyme will 
then contact the starch granules. In this manner the potential negative effects of co-localization 
of an enzyme and its substrate can be circumvented. 

In another embodiment, a tissue-specific promoter includes the endosperm-specific 
promoters such as the maize 7-zein promoter (exemplified by SEQ ED NO: 12) or the maize 
ADP-gpp promoter (exemplified by SEQ ID NO: 1 1, which includes a 5' untranslated and an 
intron sequence) or a Q protein promoter (exemplified by SEQ ID NO: 98) or a rice glutelin 1 
promoter (exemplified in SEQ ID NO:67). Thus, the present invention includes an isolated 
polynucleotide comprising a promoter comprising SEQ ID NO: 1 1, 12, 67, or 98, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 



33 



WO 2005/096804 



PCT/US2004/007182 



conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ID NO: 1 1 , 1 2, 67, or 98. 

In another embodiment of the invention, the polynucleotide encodes a hyperthermophilic 
processing enzyme that is operably linked to a chloroplast (amyloplast) transit peptide (CTP) and 
a starch binding domain, e.g., from the waxy gene. An exemplary polynucleotide in this 
embodiment encodes SEQ ID NO; 10 (a-amylase linked to the starch binding domain from 
waxy). Other exemplary polynucleotides encode a hyperthermophilic processing enzyme linked 
to a signal sequence that targets the enzyme to the endoplasmic reticulum and secretion to the 
apoplast (exemplified by a polynucleotide encoding SEQ ID NO: 13, 27, or 30, which comprises 
the N-terminal sequence from maize y-zein operably linked to a-amylase, ct-glucosidase, glucose 
isomerase, respectively), a hyperthermophilic processing enzyme linked to a signal sequence 
which retains the enzyme in the endoplasmic reticulum (exemplified by a polynucleotide 
encoding SEQ ID NO: 14, 26, 28, 29, 33, 34, 35, or 36, which comprises the N-terminal sequence 
from maize y-zein operably linked to the hyperthermophilic enzyme, which is operably linked to 
SEKDEL, wherein the enzyme is a-amylase, malA a-glucosidase, T. maritima glucose 
isomerase, T. neapolitana glucose isomerase), a hyperthermophilic processing enzyme linked to 
an N-terminal sequence that targets the enzyme to the amyloplast (exemplified by a 
polynucleotide encoding SEQ ID NO: 15, which comprises the N-terminal amyloplast targeting 
sequence from waxy operably linked to a-amylase), a hyperthermophilic fusion polypeptide 
which targets the enzyme to starch granules (exemplified by a polynucleotide encoding SEQ ID 
NO: 16, which comprises the N-terminal amyloplast targeting sequence from waxy operably 
linked to an a-amylase/waxy fusion polypeptide comprising the waxy starch binding domain), a 
hyperthermophilic processing enzyme linked to an ER retention signal (exemplified by a 
polynucleotide encoding SEQ ED NO:38 and 39). Moreover, a hyperthermophilic processing 
enzyme may be linked to a raw-starch binding site having the amino acid sequence (SEQ ID 
NO:53), wherein the polynucleotide encoding the processing enzyme is linked to the maize- 
optimized nuleic acid sequence (SEQ ED NO:54) encoding this binding site. 

Several inducible promoters have been reported. Many are described in a review by 
Gatz, in Current Opinion in Biotechnology , 7:168 (1996) and Gatz, C, Annu. Rev. Plant Physiol. 
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Plant Mol. Biol. , 48:89 (1997). Examples include tetracycline repressor system, Lac repressor 
system, copper-inducible systems, salicylate-inducible systems (such as the PR la system), 
glucocorticoid-inducible (Aoyama T. et al., N-H Plant Journal , 11:605 (1997)) and ecdysone- 
inducible systems. Other inducible promoters include ABA- and turgor-inducible promoters, the 
promoter of the auxin-binding protein gene (Schwob et al., Plant J„ 4:423 (1993)), the UDP 
glucose flavonoid glycosyl-transferase gene promoter (Ralston et al., Genetics , 119:185 (1988)), 
the MPI proteinase inhibitor promoter (Cordero et al., Plant J. , 6:141 (1994)), and the 
glyceraldehyde-3-phosphate dehydrogenase gene promoter (Kohler et al., Plant Mol. Biol. , 
29;1293 (1995); Quigley et al., J. Mol. Evol. , 29:412 (1989); Martinez et al., J. Mol. Biol. , 
208:551 (1989)). Also included are the benzene sulphonamide-inducible (U.S. 5364,780) and 
alcohol-inducible (WO 97/06269 and WO 97/06268) systems and glutathione S-transferase 
promoters. 

Other studies have focused on genes inducibly regulated in response to environmental 
stress or stimuli such as increased salinity, drought, pathogen and wounding. (Graham et al., J. 
Biol. Chem. , 260:6555 (1985); Graham et al., J. Biol. Chem. . 260:6561 (1985), Smith et al, 
Planta, 168:94 (1986)). Accumulation of metallocarboxypeptidase-inhibitor protein has been 
reported in leaves of wounded potato plants (Graham et al., Biochem. Biophys. Res. Comm. , 
101: 1 164 (1981)). Other plant genes have been reported to be induced by methyl jasmonate, 
elicitors, heat-shock, anaerobic stress, or herbicide safeners. 

Regulated expression of a chimeric transacting viral replication protein can be further 

regulated by other genetic strategies, such as, for example, Cre-mediated gene activation (Odell 

et al. Mol. Gen. Genet. , 113:369 (1990)). Thus, a DNA fragment containing 3' regulatory 

sequence bound by lox sites between the promoter and the replication protein coding sequence 

that blocks the expression of a chimeric replication gene from the promoter can be removed by 

Cre-mediated excision and result in the expression of the trans-acting replication gene. In this 

case, the chimeric Cre gene, the chimeric trans-acting replication gene, or both can be under the 

control of tissue- and developmental-specific or inducible promoters. An alternate genetic 

strategy is the use of tRNA suppressor gene. For example, the regulated expression of a tRNA 

suppressor gene can conditionally control expression of a trans-acting replication protein coding 

sequence containing an appropriate termination codon (Ulmasov et al. Plant Mol. Biol. , 35:417 
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(1 997)). Again, either the chimeric tRNA suppressor gene, the chimeric transacting replication 
gene, or both can be under the control of tissue- and developmental-specific or inducible 
promoters. 

Preferably, in the case of a multicellular organism, the promoter can also be specific to a 
particular tissue, organ or stage of development. Examples of such promoters include, but are 
not limited to, the Zea mays ADP-gpp and the Zea mays y-zein promoter and the Zea mays 
globulin promoter . 

Expression of a gene in a transgenic plant may be desired only in a certain time period 
during the development of the plant. Developmental timing is frequently correlated with tissue 
specific gene expression. For example, expression of zein storage proteins is initiated in the 
endosperm about 15 days after pollination. 

Additionally, vectors may be constructed and employed in the intracellular targeting of a 
specific gene product within the cells of a transgenic plant or in directing a protein to the 
extracellular environment. This will generally be achieved by joining a DNA sequence encoding 
a transit or signal peptide sequence to the coding sequence of a particular gene. The resultant 
transit, or signal, peptide will transport the protein to a particular intracellular, or extracellular 
destination, respectively, and will then be post-translationally removed. Transit or signal 
peptides act by facilitating the transport of proteins through intracellular membranes, e.g., 
vacuole, vesicle, plastid and mitochondrial membranes, whereas signal peptides direct proteins 
through the extracellular membrane. 

A signal sequence such as the maize y-zein N-terminal signal sequence for targeting to 
the endoplasmic reticulum and secretion into the apoplast may be operably linked to a 
polynucleotide encoding a hyperthermophilic processing enzyme in accordance with the present 
invention (Torrent et al., 1997). For example, SEQ ID NOs:13, 27, and 30 provides for a 
polynucleotide encoding a hyperthermophilic enzyme operably linked to the N-terminal 
sequence from maize y-zein protein. Another signal sequence is the amino acid sequence 
SEKDEL for retaining polypeptides in the endoplasmic reticulum (Munro and Pelham, 1987). 
For example, a polynucleotide encoding SEQ ED NOS:14, 26, 28, 29, 33, 34, 35, or 36, which 
comprises the N-terminal sequence from maize y-zein operably linked to a processing enzyme 
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which is operably linked to SEKDEL. A polypeptide may also be targeted to the amyloplast by 
fusion to the waxy amyloplast targeting peptide (Klosgen et al., 1986) or to a starch granule. For 
example, the polynucleotide encoding a hyperthermophilic processing enzyme may be operably 
linked to a chloroplast (amyloplast) transit peptide (CTP) and a starch binding domain, e.g., from 
the waxy gene. SEQ ED NO: 10 exemplifies a-amylase linked to the starch binding domain from 
waxy. SEQ DD NO: 15 exemplifies the N-terminal sequence amyloplast targeting sequence from 
waxy operably linked to a-amylase. Moreover, the polynucleotide encoding the processing 
enzyme may be fused to target starch granules using the waxy starch binding domain. For 
example, SEQ ED NO: 16 exemplifies a fusion polypeptide comprising the N-terminal amyloplast 
targeting sequence from waxy operably linked to an a-amylase/waxy fusion polypeptide 
comprising the waxy starch binding domain. 

The polynucleotides of the present invention, in addition to processing signals, may 
further include other regulatory sequences, as is known in the art. "Regulatory sequences" and 
"suitable regulatory sequences" each refer to nucleotide sequences located upstream (5 1 non- 
coding sequences), within, or downstream (3* non-coding sequences) of a coding sequence, and 
which influence the transcription, RNA processing or stability, or translation of the associated 
coding sequence. Regulatory sequences include enhancers, promoters, translation leader 
sequences, introns, and polyadenylation signal sequences. They include natural and synthetic 
sequences as well as sequences, which may be a combination of synthetic and natural sequences. 

Selectable markers may also be used in the present invention to allow for the selection of 
transformed plants and plant tissue, as is well-known in the art. One may desire to employ a 
selectable or screenable marker gene as, or in addition to, the expressible gene of interest. 
''Marker genes" are genes that impart a distinct phenotype to cells expressing the marker gene 
and thus allow such transformed cells to be distinguished from cells that do not have the marker. 
Such genes may encode either a selectable or screenable marker, depending on whether the 
marker confers a trait which one can select for by chemical means, i.e., through the use of a 
selective agent (e.g., a herbicide, antibiotic, or the like), or whether it is simply a trait that one 
can identify through observation or testing, i.e., by screening (e.g., the R-locus trait). Of course, 
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many examples of suitable marker genes are known to the art and can be employed in the 
practice of the invention. 

Included within the terms selectable or screenable marker genes are also genes which 
encode a "secretable marker" whose secretion can be detected as a means of identifying or 
selecting for transformed cells. Examples include markers which encode a secretable antigen that 
can be identified by antibody interaction, or even secretable enzymes which can be detected by 
their catalytic activity. Secretable proteins fall into a number of classes, including small, 
diffusible proteins detectable, e.g., by ELISA; small active enzymes detectable in extracellular 
solution (e.g., a-amylase, p-lactamase, phosphinothricin acetyltransferase); and proteins that are 
inserted or trapped in the cell wall (e.g., proteins that include a leader sequence such as that 
found in the expression unit of extensin or tobacco PR-S). 

With regard to selectable secretable markers, the use of a gene that encodes a protein that 
becomes sequestered in the cell wall, and which protein includes a unique epitope is considered 
to be particularly advantageous. Such a secreted antigen marker would ideally employ an 
epitope sequence that would provide low background in plant tissue, a promoter-leader sequence 
that would impart efficient expression and targeting across the plasma membrane, and would 
produce protein that is bound in the cell wall and yet accessible to antibodies. A normally 
secreted wall protein modified to include a unique epitope would satisfy all such requirements. 

One example of a protein suitable for modification in this manner is extensin, or 
hydroxyproline rich glycoprotein (HPRG). For example, the maize HPRG (Steifel et al., The 
Plant Cell , 2:785 (1990)) molecule is well characterized in terms of molecular biology, 
expression and protein structure. However, any one of a variety of extensins and/or glycine-rich 
wall proteins (Keller et al., EMBQ Journal . 8:1309 (1989)) could be modified by the addition of 
an antigenic site to create a screenable marker. 

a. Selectable Markers 

Possible selectable markers for use in connection with the present invention include, but 
are not limited to, a neo or nptll gene (Potrykus et al., Mol. Gen. Genet ., 199:183 (1985)) which 
codes for kanamycin resistance and can be selected for using kanamycin, G418, and the like; a 
bar gene which confers resistance to the herbicide phosphinothricin; a gene which encodes an 

38 



WO 2005/096804 



PCT/US2004/007182 



altered EPSP synthase protein (Hinchee et al., Biotech ., 6:915 (1988)) thus conferring glyphosate 
resistance; a nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to 
bromoxynil (Stalker et al., Science , 242:419 (1988)); a mutant acetolactate synthase gene (ALS) 
which confers resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals 
(European Patent Application 154,204, 1985); a methotrexate-resistant DHFR gene (Thillet et 
al., J. Biol. Chem .. 263:12500 (1988)); a dalapon dehalogenase gene that confers resistance to 
the herbicide dalapon; a phosphomannose isomerase (PMI) gene; a mutated anthranilate synthase 
gene that confers resistance to 5-methyl tryptophan; the hph gene which confers resistance to the 
antibiotic hygromycin; or the mannose-6-phosphate isomerase gene (also referred to herein as 
the phosphomannose isomerase gene), which provides the ability to metabolize mannose (U.S. 
Patent Nos. 5,767,378 and 5,994,629). One skilled in the art is capable of selecting a suitable 
selectable marker gene for use in the present invention. Where a mutant EPSP synthase gene is 
employed, additional benefit may be realized through the incorporation of a suitable chloroplast 
transit peptide, CTP (European Patent Application 0,218,571, 1987). 

An illustrative embodiment of a selectable marker gene capable of being used in systems 
to select transformants are the genes that encode the enzyme phosphinothricin acetyltransferase, 
such as the bar gene from Streptomyces hygroscopicus or the pat gene from Streptomyces 
vihdochromogenes. The enzyme phosphinothricin acetyl transferase (PAT) inactivates the 
active ingredient in the herbicide bialaphos, phosphinothricin (PPT). PPT inhibits glutamine 
synthetase, (Murakami et al., Mol. Gen. Genet .. 205:42 (1986); Twell et al., Plant Physiol . 
91:1270 (1989)) causing rapid accumulation of ammonia and cell death. The success in using 
this selective system in conjunction with monocots was particularly surprising because of the 
major difficulties which have been reported in transformation of cereals (Potrykus, Trends 
Biotech., 7:269 (1989)). 

Where one desires to employ a bialaphos resistance gene in the practice of the invention, 

a particularly useful gene for this purpose is the bar or pat genes obtainable from species of 

Streptomyces (e.g., ATCC No. 21 ,705). The cloning of the bar gene has been described 

(Murakami et al., Mol. Gen. Genet ., 205:42 (1986); Thompson et al., EMBO Journal . 6:2519 

(1987)) as has the use of the bar gene in the context of plants other than monocots (De Block et 

al., EMBO Journal , 6:2513 (1987); De Block et al., Plant Physiol ., 91:694 (1989)). 
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h. Screenable Markers 

Screenable markers that may be employed include, but are not limited to, a (J- 
glucuronidase or uidA gene (GUS) which encodes an enzyme for which various chromogeriic 
substrates are known; an R-locus gene, which encodes a product that regulates the production of 
anthocyanin pigments (red color) in plant tissues (Dellaporta et al., in Chromosom e Structure 
and Function , pp. 263-282 (1988)); a p-lactamase gene (Sutcliffe, PNAS USA , 75:3737 (1978)), 
which encodes an enzyme for which various chromogenic substrates are known (e.g., PADAC, a 
chromogcnic cephalosporin); a xylE gene (Zukowsky et ah, PNAS USA , 80:1 101 (1983)) which 
encodes a catechol dioxygenase that can convert chromogenic catechols; an a-amylase gene 
(lkuta et al., Biotech. , 8:241 (1990)); a tyrosinase gene (Katz et al., J. Gen. Microbiol. . 129:2703 
(1983)) which encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone 
which in turn condenses to form the easily detectable compound melanin; a P-galactosidase 
gene, which encodes an enzyme for which there are chromogenic substrates; a luciferase (lux) 
gene (Ow et al, Science , 234.856 (1986)), which allows for bioluminescence detection; or an 
aequorin gene (Prasher et al., Biochem. Biophvs. Res. Comm. . 126: 1 259 (1985)), which may be 
employed in calcium-sensitive bioluminescence detection, or a green fluorescent protein gene 
(Niedz et al., Plant Cell Reports , 14: 403 (1995)). 

Genes from the maize R gene complex are contemplated to be particularly useful as 
screenable markers. The R gene complex in maize encodes a protein that acts to regulate the 
production of anthocyanin pigments in most seed and plant tissue. A gene from the R gene 
complex is suitable for maize transformation, because the expression of this gene in transformed 
cells does not harm the cells. Thus, an R gene introduced into such cells will cause the 
expression of a red pigment and, if stably incorporated, can be visually scored as a red sector. If 
a maize line carries dominant allelles for genes encoding the enzymatic intermediates in the 
anthocyanin biosynthetic pathway (C2, Al, A2, Bzl and Bz2), but carries a recessive allele at 
the R locus, transformation of any cell from that line with R will result in red pigment formation. 
Exemplary lines include Wisconsin 22 which contains the rg-Stadler allele and TR1 12, a K55 
derivative which is r-g, b, PI . Alternatively any genotype of maize can be utilized if the CI and 
R alleles are introduced together. A further screenable marker contemplated for use in the 
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present invention is firefly luciferase, encoded by the lux gene. The presence of the lux gene in 
transformed cells may be detected using, for example, X-ray film, scintillation counting, 
fluorescent spectrophotometry, low-light video cameras, photon counting cameras or multiwell 
luminometry. It is also envisioned that this system may be developed for populational screening 
for bioluminescence, such as on tissue culture plates, or even for whole plant screening. 

The polynucleotides used to transform the plant may include, but is not limited to, DNA 
from plant genes and non-plant genes such as those from bacteria, yeasts, animals or viruses. 
The introduced DNA can include modified genes, portions of genes, or chimeric genes, including 
genes from the same or different maize genotype. The term "chimeric gene" or "chimeric DNA" 
is defined as a gene or DNA sequence or segment comprising at least two DNA sequences or 
segments from species which do not combine DNA under natural conditions, or which DNA 
sequences or segments are positioned or linked in a manner which does not normally occur in the 
native genome of the untransformed plant. 

Expression cassettes comprising the polynucleotide encoding a hyperthermophilic 
processing enzyme, and preferably a codon-optimized polynucleotide is further provided. It is 
preferred that the polynucleotide in the expression cassette (the first polynucleotide) is operably 
linked to regulatory sequences, such as a promoter, an enhancer, an intron, a termination 
sequence, or any combination thereof, and, optionally, to a second polynucleotide encoding a 
signal sequence (N- or C-terminal) which directs the enzyme encoded by the first polynucleotide 
to a particular cellular or subcellular location. Thus, a promoter and one or more signal 
sequences can provide for high levels of expression of the enzyme in particular locations in a 
plant, plant tissue or plant cell. Promoters can be constitutive promoters, inducible (conditional) 
promoters or tissue-specific promoters, e.g., endosperm-specific promoters such as the maize 7- 
zein promoter (exemplified by SEQ ID NO: 12) or the maize ADP-gpp promoter (exemplified by 
SEQ ID NO: 1 1, which includes a 5' untranslated and an intron sequence). The invention also 
provides an isolated polynucleotide comprising a promoter comprising SEQ ED NO:l 1 or 12, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 
conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ID NO:l 1 or 12. Also provided are vectors 
which comprise the expression cassette or polynucleotide of the invention and transformed cells 
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comprising the polynucleotide, expression cassette or vector of the invention. A vector of the 
invention can comprise a polynucleotide sequence which encodes more than one 
hyperthcrmophilic processing enzyme of the invention, which sequence can be in sense or 
anti sense orientation, and a transformed cell may comprise one or more vectors of the invention. 
Preferred vectors are those useful to introduce nucleic acids into plant cells. 

Transformation 

The expression cassette, or a vector construct containing the expression cassette may be 
inserted into a cell. The expression cassette or vector construct may be carried episomally or 
integrated into the genome of the cell. The transformed cell may then be grown into a transgenic 
plant. Accordingly, the invention provides the products of the transgenic plant. Such products 
may include, but are not limited to, the seeds, fruit, progeny, and products of the progeny of the 
transgenic plant. 

A variety of techniques are available and known to those skilled in the art for 
introduction of constructs into a cellular host. Transformation of bacteria and many eukaryotic 
cells may be accomplished through use of polyethylene glycol, calcium chloride, viral infection, 
phage infection, electroporation and other methods known in the art. Techniques for 
transforming plant cells or tissue include transformation with DNA employing A. tumefaciens or 
A. rhizogenes as the transforming agent, electroporation, DNA injection, microprojectile 
bombardment, particle acceleration, etc. (See, for example, EP 295959 and EP 138341). 

In one embodiment, binary type vectors of Ti and Ri plasmids of Agrobacterium spp. Ti- 
derived vectors are used to transform a wide variety of higher plants, including 
monocotyledonous and dicotyledonous plants, such as soybean, cotton, rape, tobacco, and rice 
(Pacciotti et al. Bio/Technology, 3:241 (1985): Byrne et al. Plant Cell Tis sue and Organ Culture, 
8:3 (1987); Sukhapinda et al. Plant Mol. BioL 8:209 (1987); Lorz et al. Mol. Gen. Genet., 
199:178 (1985); Potrykus Mol. Gen. Genet. , 199:183 (1985); Park et al, J. Plant Biol.. 38*365 
(1985): Hiei et al., Plant J. , 6:271(1994)). The use of T-DNA to transform plant cells has 
received extensive study and is amply described (EP 120516; Hoekema, In: The Binary Plant 
Vector System . Offset-drukkerij Kanters B.V.; Alblasserdam (1985), Chapter V; Knauf, et al., 
Genetic Analysis of Host Range Expression by Agrobacterium In: Molecular G enetics of the 
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Bacteria-Plant Interaction , Puhler, A. ed., Springer-Verlag, New York, 1983, p. 245; and An. et 
aL EMBO J. , 4:277 (1985)). 

Other transformation methods are available to those skilled in the art, such as direct 
uptake of foreign DNA constructs (see EP 295959), techniques of electroporation (Fromrn et al. 
Nature (London) , 319:791 (1986), or high velocity ballistic bombardment with metal particles 
coated with the nucleic acid constructs (Kline et al. Nature (London) 327:70 (1987), and U.S. 
Patent No. 4,945,050). Once transformed, the cells can be regenerated by those skilled in the art. 
Of particular relevance are the recently described methods to transform foreign genes into 
commercially important crops, such as rapeseed (De Block et al., Plant Physiol. 91 :694-701 
(1989)), sunflower (Everett et al., Bio/Technology , 5:1201(1987)), soybean (McCabe et al., 
Bio/Technology , 6:923 (1 988); Hinchee et al., Bio/Technology , 6:915 (1988); Chee et al., Plant 
Physiol . 91:1212 (1989); Christou et al., Proc. Natl. Acad. Sci USA , 86:7500 (1989) EP 
301749), rice (Hiei et al, Plant J. . 6:271 (1994)), and com (Gordon Kamm et al, Plant Cell 
2:603 (1990); Fromrn et al, Biotechnology , 8:833, (1990)). 

Expression vectors containing genomic or synthetic fragments can be introduced into 
protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced 
into intact tissue. General methods of culturing plant tissues are provided, for example, by Maki 
et al "Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular 
Biology & Biotechnology, Glich et al. (Eds.), pp. 67-88 CRC Press (1993); and by Phillips et al 
"Cell-Tissue Culture and In-Vitro Manipulation" in Corn & Corn Improvement, 3rd Edition 10, 
Sprague et al (Eds.) pp. 345-387, American Society of Agronomy Inc. (1988). 

In one embodiment, expression vectors may be introduced into maize or other plant 
tissues using a direct gene transfer method such as microprojectile-mediated delivery, DNA 
injection, electroporation and the like. Expression vectors are introduced into plant tissues using 
the microprojectile media delivery with the biolistic device. See, for example, Tomes et al 
"Direct DNA transfer into intact plant cells via microprojectile bombardment" in Gamborg and 
Phillips (Eds.) Plant Cell, Tissue and Organ Culture: Fundamental Methods, Springer Verlag, 
Berlin (1995). Nevertheless, the present invention contemplates the transformation of plants 
with a hyperthermophilic processing enzyme in accord with known transforming methods. Also 
see, Weissinger et al, Annual Rev. Genet. , 22:421 (1988); Sanford et al, Particulate Science and 
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Technology . 5:27 (1987) (onion); Christou et al., Plant PhvsioL 87:671 (1988 )(soybean); 
McCabe et al., Bio/Technology , 6:923 (1988) (soybean); Datta et al., Bio/Technology , 8:736 
(1990) (rice); Klein et al., Proc. Natl. Acad. Sci. USA , 85:4305 (1988 )(maize); Klein et al., 
Bio/Technology , 6:559 (1988)(maize); Klein et al., Plant PhvsioL 91:440 (1988)(maize); Fromm 
et al., Bio/Technology , 8:833 (1990) (maize); and Gordon-Kamm et al., Plant Cell 2, 603 
(1990)(maize); Svab et al., Proc. Natl. Acad. Sci. USA , 87:8526 (1990) (tobacco chloroplast); 
Koziel et al., Biotechnology , U: 194 (1993) (maize); Shimamoto et al., Nature , 338:274 (1989) 
(rice); Christou et al., Biotechnology , 9:957 (1991) (rice); European Patent Application EP 0 332 
581 (orchardgrass and other Pooideae); Vasil et al., Biotechnology , ii:1553 (1993) (wheat); 
Weeks et al., Plant PhvsioL 102:1077 (1993) (wheat). Methods in Molecular Biology, 82. 
Arabidopsis Protocols Ed. Martinez-Zapater and Salinas 1998 Humana Press (Arabidopsis). 

Transformation of plants can be undertaken with a single DNA molecule or multiple 
DNA molecules (i.e., co-transformation), and both these techniques are suitable for use with the 
expression cassettes and constructs of the present invention. Numerous transformation vectors 
are available for plant transformation, and the expression cassettes of this invention can be used 
in conjunction with any such vectors. The selection of vector will depend upon the preferred 
transformation technique and the target species for transformation. 

Ultimately, the most desirable DNA segments for introduction into a monocot genome 
may be homologous genes or gene families which encode a desired trait (e.g., hydrolysis of 
proteins, lipids or polysaccharides) and which are introduced under the control of novel 
promoters or enhancers, etc., or perhaps even homologous or tissue specific (e.g., root-, 
collar/sheath-, whorl-, stalk-, earshank-, kernel- or leaf-specific) promoters or control elements. 
Indeed, it is envisioned that a particular use of the present invention will be the targeting of a 
gene in a constitutive manner or in an inducible manner. 

Examples of Suitable Transformation Vectors 

Numerous transformation vectors available for plant transformation are known to those 
of ordinary skill in the plant transformation arts, and the genes pertinent to this invention can be 
used in conjunction with any such vectors known in the art. The selection of vector will depend 
upon the preferred transformation technique and the target species for transformation. 
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Many vectors are available for transformation using Agrobacterium tumefaciens. These 
typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, 
Nucl. Acids Res, (1984)). Below, the construction of two typical vectors suitable for 
Agrobacterium transformation is described. 

pCIB2Q0 and pCIB2001 

The binary vectors pcIB200 and pCEB2001 are used for the construction of recombinant 
vectors for use with Agrobacterium and are constructed in the following manner. pTJS75kan is 
created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. BacterioL 164: 446 (1985)) 
allowing excision of the tetracycline-resistance gene, followed by insertion of an AccI fragment 
from pUC4K carrying an NPTI1 (Messing & Vierra, Gene , 19: 259 (1982): Bevan et al, Nature, 
304: 184 (1983): McBride et al., Plant Molecular Biology , 14: 266 (1990)). Xhol linkers are 
ligated to the EcoRV fragment of PCIB7 which contains the left and right T-DNA borders, a 
plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et al., Gene , 53: 1 53 
(1987)), and the Xhol-digested fragment are cloned into Sall-digested pTJS75kan to create 
pC!B200 (see also EP 0 332 104, example 19). pCB200 contains the following unique 
polylinker restriction sites: EcoRI, SstI, KpnI, Bglll, Xbal, and Sail. pCDB2001 is a derivative of 
pCIB200 created by the insertion into the polylinker of additional restriction sites. Unique 
restriction sites in the polylinker of pCIB2001 are EcoRJ, SstI, KpnI, Bglll, Xbal, Sail, Mlul, 
Bell, Avrll, Apal, Hpal, and Stul. pC!B2001, in addition to containing these unique restriction 
sites also has plant and bacterial kanamycin selection, left and right T-DNA borders for 
Agrobacterium-mcdiaXed transformation, the RK2 -derived trfA function for mobilization 
between E. coli and other hosts, and the O/Tand OriV functions also from RK2. The pCIB2001 
polylinker is suitable for the cloning of plant expression cassettes containing their own 
regulatory signals, 
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pCIBlO and Hygrorovcin Selection Derivatives thereof : 

The binary vector pCEB 1 0 contains a gene encoding kanamycin resistance for selection in 
plants and T-DNA right and left border sequences and incorporates sequences from the wide 
host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its 
construction is described by Rothstein et al. ( Gene , 53: 153 (1987)). Various derivatives of 
pCIBlO are constructed which incorporate the gene for hygromycin B phosphotransferase 
described by Gritz et al. ( Gene , 25: 179 (1983)). These derivatives enable selection of transgenic 
plant cells on hygromycin only (pCEB743), or hygromycin and kanamycin (pCEB715, pCIB717). 

b. Vectors Suitable for non~Azrobacterium Transformation 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently vectors 
lacking these sequences can be utilized in addition to vectors such as the ones described above 
which contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium 
include transformation via particle bombardment, protoplast uptake (e.g., PEG and 
electroporation) and microinjection. The choice of vector depends largely on the preferred 
selection for the species being transformed. Non-limiting examples of the construction of typical 
vectors suitable for non- Agrobacterium transformation is further described. 

PCIB3064 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid pCIB246 
comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene and the CaMV 
35S transcriptional terminator and is described in the PCT published application WO 93/07278. 
The 35S promoter of this vector contains two ATG sequences 5' of the start site. These sites are 
mutated using standard PGR techniques in such a way as to remove the ATGs and generate the 
restriction sites Sspl and PvuII. The new restriction sites are 96 and 37 bp away from the unique 
Sail site and 101 and 42 bp away from the actual start site. The resultant derivative of pCIB246 
is designated pCIB3025. The GUS gene is then excised from pCEB3025 by digestion with Sail 
and Sad, the termini rendered blunt and religated to generate plasmid pCEB3060. The plasmid 
pJIT82 may be obtained from the John Innes Centre, Norwich and the a 400 bp Smal fragment 
containing the bar gene from Streptomyces viridochromogenes is excised and inserted into the 
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Hpal site of pCIB3060 (Thompson et al., EMBO J , 6: 2519 (1987)). This generated pCEB3064, 
which comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbicide selection, a gene for ampicillin resistance (for selection in E. coli) and a polylinker 
with the unique sites Sph], PstI, Hindlll, and BamHI. This vector is suitable for the cloning of 
plant expression cassettes containing their own regulatory signals. 
pSQG19and pSOG35 : 

The plasmid pSOG35 is a transformation vector that utilizes the E. coli gene 
dihydrofolate reductase (DHFR) as a selectable marker conferring resistance to methotrexate. 
PCR is used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adhl gene (-550 
bp) and 18 bp of the GUS untranslated leader sequence from pSOGlO. A 250-bp fragment 
encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and these two 
PCR fragments are assembled with a SacI-PstI fragment from pB1221 (Clontech) which 
comprises the pUC19 vector backbone and the nopaline synthase terminator. Assembly of these 
fragments generates pSOG 19 which contains the 35S promoter in fusion with the intron 6 
sequence, the GUS leader, the DHFR gene and the nopaline synthase terminator. Replacement 
of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus 
(MCMV) generates the vector pSOG35. pSOG19 and pSOG35 carry the pUC gene for 
ampicillin resistance and have Hindlll, SphI, PstI and EcoRI sites available for the cloning of 
foreign substances. 

c. Vector Suitable for Chloroplast Transformation 

For expression of a nucleotide sequence of the present invention in plant plastids, plastid 
transformation vector pPH 1 43 (WO 97/3201 1 , example 36) is used. The nucleotide sequence is 
inserted into pPH143 thereby replacing the PROTOX coding sequence. This vector is then used 
for plastid transformation and selection of transformants for spectinomycin resistance. 
Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. In 
this case, transformants are selected for resistance to PROTOX inhibitors. 

Plant Hosts Subject to Transformation Methods 

Any plant tissue capable of subsequent clonal propagation, whether by organogenesis 
or embryogenesis, may be transformed with a construct of the present invention. The term 
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organogenesis means a process by which shoots and roots are developed sequentially from 
meristematic centers while the term embryogenesis means a process by which shoots and roots 
develop together in a concerted fashion (not sequentially), whether from somatic cells or 
gametes. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include differentiated and undifferentiated tissues or plants, including but not limited to 
leaf disks, roots, stems, shoots, leaves, pollen, seeds, embryos, cotyledons, hypocotyls, 
megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristems, axillary 
buds, and root meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl 
meristem), tumor tissue, and various forms of cells and culture such as single cells, protoplast, 
embryos, and callus tissue. The plant tissue may be in plants or in organ, tissue or cell culture. 

Plants of the present invention may take a variety of forms. The plants may be chimeras 
of transformed cells and non-transformed cells; the plants may be clonal transformants (e.g., all 
cells transformed to contain the expression cassette); the plants may comprise grafts of 
transformed and untransformed tissues (e.g., a transformed root stock grafted to an 
untransformed scion in citrus species). The transformed plants may be propagated by a variety 
of means, such as by clonal propagation or classical breeding techniques. For example, first 
generation (or Tl) transformed plants may be selfed to give homozygous second generation (or 
T2) transformed plants, and the T2 plants further propagated through classical breeding 
techniques. A dominant selectable marker (such as npt D) can be associated with the expression 
cassette to assist in breeding. 

The present invention may be used for transformation of any plant species, including 
monocots or dicots, including, but not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, 
B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa 
(Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, 
Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum 
miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower 
(Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean 
(Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis 
hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 
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batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple 
(Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), 
banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), 
mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium 
occidental), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets 
(Bela vulgaris), sugarcane (Saccharum spp.), oats, barley, vegetables, ornamentals, woody plants 
such as conifers and deciduous trees, squash, pumpkin, hemp, zucchini, apple, pear, quince, 
melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry, blackberry, soybean, 
sorghum, sugarcane, rapeseed, clover, carrot, and Arabidopsis thaliana. 

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), 
green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), 
cauliflower, broccoli, turnip, radish, spinach, asparagus, onion, garlic, pepper, celery, and 
members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), 
and musk melon (C melo), Ornamentals include azalea (Rhododendron spp.), hydrangea 
(Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa 
spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), 
poinsettia (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in 
practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), 
slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), 
and Monterey pine (Pinus radiata), Douglas-fir (Pseudotsuga menziesii); Western hemlock 
(Tsuga canadensis); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such 
as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red 
cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). Leguminous 
plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden 
beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are 
not limited to, Arachis, e.g., peanuts, Vicia, e.g., crown vetch, hairy vetch, adzuki bean, mung 
bean, and chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus, e.g., common bean and lima 
bean, Pisum, e.g., field bean, Melilotus, e.g., clover, Medicago, e.g., alfalfa, Lotus, e.g., trefoil, 
lens, e.g., lentil, and false indigo. Preferred forage and turf grass for use in the methods of the 
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invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and 
redtop. 

Preferably, plants of the present invention include crop plants, for example, corn, alfalfa, 
sunflower, Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, barley, 
rice, tomato, potato, squash, melons, legume crops, etc. Other preferred plants include Liliopsida 
and Panicoideae. 

Once a desired DN A sequence has been transformed into a particular plant species, it 
may be propagated in that species or moved into other varieties of the same species, particularly 
including commercial varieties, using traditional breeding techniques. 

Below are descriptions of representative techniques for transforming both dicotyledonous 
and monocotyledonous plants, as well as a representative plastid transformation technique. 

a. Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-based techniques and techniques that do not require Agrobactehum. Non- 
Agrobacterium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques are 
described by Paszkowski et al., EMBO J , 3: 2717 (1984), Potrykus et a!., Mol. Gen. Genet., 199: 
169 (1985), Reich et al., Biotechnology , 4: 1001 (1986), and Klein et al., Nature , 327: 70 (1987). 
In each case the transformed cells are regenerated to whole plants using standard techniques 
known in the art. 

Agrobacterium-medxaXed transformation is a preferred technique for transformation of 

dicotyledons because of its high efficiency of transformation and its broad utility with many 

different species. Agrobacterium transformation typically involves the transfer of the binary 

vector carr>nng the foreign DNA of interest (e.g. pCJB200 or pCB2001) to an appropriate 

Agrobacterium strain which may depend on the complement of vir genes carried by the host 

Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g., strain CIB542 

for pCrB200 and pCD32001 (Uknes et al., Plant Cell , 5: 159 (1993)). The transfer of the 

recombinant binary vector to Agrobacterium is accomplished by a triparental mating procedure 

using E. coli carrying the recombinant binary vector, a helper E. coli strain which carries a 
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plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the 
target Agrobacterium strain. Alternatively, the recombinant binary vector can be transferred to 
Agrobactehum by DNA transformation (Hdfgen & Willmitzer, Nucl. Acids Res. , 16: 9877 
(1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually involves 
co-cultivation of the Agrobacterium with explants from the plant and follows protocols well 
known in the art. Transformed tissue is regenerated on selectable medium carrying the antibiotic 
or herbicide resistance marker present between the binary plasmid T-DNA borders. 

The vectors may be introduced to plant cells in known ways. Preferred cells for 
transformation include Agrobacterium, monocot cells and dicots cells, including Liliopsida cells 
and Panicoideae cells. Preferred monocot cells are cereal cells, e.g., maize (corn), barley, and 
wheat, and starch accumulating dicot cells, e.g., potato. 

Another approach to transforming a plant cell with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. Patent 
Nos. 4,945,050, 5,036,006, and 5,100,792. Generally, this procedure involves propelling inert or 
biologically active particles at the cells under conditions effective to penetrate the outer surface 
of the cell and afford incorporation within the interior thereof. When inert particles are utilized, 
the vector can be introduced into the cell by coating the particles with the vector containing the 
desired gene. Alternatively, the target cell can be surrounded by the vector so that the vector is 
carried into the cell by the wake of the particle. Biologically active particles (e.g., dried yeast 
cells, dried bacterium or a bacteriophage, each containing DNA sought to be introduced) can also 
be propelled into plant cell tissue. 

b. Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. Preferred 

techniques include direct gene transfer into protoplasts using polyethylene glycol (PEG) or 

electroporation techniques, and particle bombardment into callus tissue. Transformations can be 

undertaken with a single DNA species or multiple DNA species (i.e., co-tTansformation) and 

both these techniques are suitable for use with this invention. Co-transformation may have the 

advantage of avoiding complete vector construction and of generating transgenic plants with 

unlinked loci for the gene of interest and the selectable marker, enabling the removal of the 
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selectable marker in subsequent generations, should this be regarded desirable. However, a 
disadvantage of the use of co-transformation is the less than 100% frequency with which 
separate DNA species are integrated into the genome (Schocher et al., Biotechnology . 4: 1093 
1986)). 

Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques 
for the preparation of callus and protoplasts from an elite inbred line of maize, transformation of 
protoplasts using PEG or electroporation, and the regeneration of maize plants from transformed 
protoplasts. Gordon-Kamm et al. ( Plant Cell 2: 603 (1990)) and Fromm et al. ( Biotechnology , 
8: 833 (1990)) have published techniques for transformation of A188-derived maize line using 
particle bombardment. Furthermore, WO 93/07278 and Koziel et al. ( Biotechnology , U; 194 
(1993)) describe techniques for the transformation of elite inbred lines of maize by particle 
bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length excised 
from a maize ear 14-15 days after pollination and a PDS-lOOOHe Biolistics device for 
bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques utilizing 
protoplasts or particle bombardment. Protoplast-mediated transformation has been described for 
Japonica-types and Indica-lypes (Zhang et al., Plant Cell Rep , 7: 379 (1988); Shimamoto et al., 
Nature , 338: 274 (1989); Datta et al., Biotechnology , 8: 736 (1990)). Both types are also 
routinely transformable using particle bombardment (Christou et al., Biotechnology , 9: 957 
(1991)). Furthermore, WO 93/21335 describes techniques for the transformation of rice via 
electroporation. Patent Application EP 0 332 581 describes techniques for the generation, 
transformation and regeneration of Pooideae protoplasts. These techniques allow the 
transformation of Dactylis and wheat. Furthermore, wheat transformation has been described by 
Vasil et al. ( Biotechnology , J_0: 667 (1992)) using particle bombardment into cells of type C 
long-term regenerable callus, and also by Vasil et al. ( Biotechnology , H- 1553 (1993)) and 
Weeks et al. ( Plant Physiol. , 102 : 1077 (1993)) using particle bombardment of immature 
embryos and immature embryo-derived callus. A preferred technique for wheat transformation, 
however, involves the transformation of wheat by particle bombardment of immature embryos 
and includes either a high sucrose or a high maltose step prior to gene delivery. Prior to 
bombardment, any number of embryos (0.75-1 mm in length) are plated onto MS medium with 
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3% sucrose (Murashiga & Skoog, Phvsiologia Plantarum , 15: 473 (1962)) and 3 mg/I 2,4-D for 
induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day of 
bombardment, embryos are removed from the induction medium and placed onto the osmoticum 
(i.e., induction medium with sucrose or maltose added at the desired concentration, typically 
1 5%). The embryos are allowed to plasmolyze for 2-3 hours and are then bombarded. Twenty 
embryos per target plate is typical, although not critical. An appropriate gene-carrying plasmid 
(such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles using standard 
procedures. Each plate of embryos is shot with the DuPont Biolistics® helium device using a 
burst pressure of about 1000 psi using a standard 80 mesh screen. After bombardment, the 
embryos are placed back into the dark to recover for about 24 hours (still on osmoticum). After 
24 hours, the embryos are removed from the osmoticum and placed back onto induction medium 
where they stay for about a month before regeneration. Approximately one month later the 
embryo explants with developing embryogenic callus are transferred to regeneration medium 
(MS + 1 mg/liter NAA, 5 mg/liter GA), further containing the appropriate selection agent (10 
mg/I basta in the case of pCIB3064 and 2 mg/1 methotrexate in the case of pSOG35). After 
approximately one month, developed shoots are transferred to larger sterile containers known as 
"GA7s" which contain half-strength MS, 2% sucrose, and the same concentration of selection 
agent. 

Transformation of monocotyledons using Agrobacterium has also been described. See, 
WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein by 

reference. 

c. Transformation of Plastids 

Seeds o{Nico(iana tabacum c.v. 'Xanthi nc' are germinated seven per plate in a 1" 
circular array on T agar medium and bombarded 12-14 days after sowing with 1 Jim tungsten 
particles (M10, Biorad, Hercules, CA) coated with DNA from plasmids pPH143 and pPH145 
essentially as described (Svab and Maliga, PNAS. 90:913 (1993)). Bombarded seedlings are 
incubated on T medium for two days after which leaves are excised and placed abaxial side up in 
bright light (350-500 nmol photons/m 2 /s) on plates of RMOP medium (Svab, Hajdukiewicz and 
Maliga, PNAS , 87:8526 (1990)) containing 500 jig/ml spectinomycin dihydrochloride (Sigma, 
St. Louis, MO). Resistant shoots appearing underneath the bleached leaves three to eight weeks 
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after bombardment are subcloned onto the same selective medium, allowed to form callus, and 
secondary shoots isolated and subcloned. Complete segregation of transformed plastid genome 
copies (homoplasmicity) in independent subclones is assessed by standard techniques of 
Southern blotting (Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor (1989)). BamHI/EcoRl-digested total cellular DNA 
(Mettler, I. J. Plant Mol Biol Reporter , 5:346 (1987)) is separated on 1% Tris-borate (TBE) 

32 

agarose gels, transferred to nylon membranes (Amersham) and probed with P-labeled random 
primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA fragment from pC8 
containing a portion of the rps7/12 plastid targeting sequence. Homoplasmic shoots are rooted 
aseptically on spectinomycin-containing MS/TBA medium (McBride et al., PNAS, 21:7301 
(1994)) and transferred to the greenhouse. 

Production and Characterization of Stably Transformed Plants 

Transformed plant cells are then placed in an appropriate selective medium for selection 
of transgenic cells, which are then grown to callus. Shoots are grown from callus and plantlets 
generated from the shoot by growing in rooting medium. The various constructs normally will 
be joined to a marker for selection in plant cells. Conveniently, the marker may be resistance to 
a biocide (particularly an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, 
chloramphenicol, herbicide, or the like). The particular marker used will allow for selection of 
transformed cells as compared to cells lacking the DNA which has been introduced. 
Components of DNA constructs, including transcription/expression cassettes of this invention, 
may be prepared from sequences, which are native (endogenous) or foreign (exogenous) to the 
host. By "foreign" it is meant that the sequence is not found in the wild-type host into which the 
construct is introduced. Heterologous constructs will contain at least one region, which is not 
native to the gene from which the transcription-initiation-region is derived. 

To confirm the presence of the transgenes in transgenic cells and plants, a Southern blot 
analysis can be performed using methods known to those skilled in the art. Integration of a 
polynucleic acid segment into the genome can be detected and quantitated by Southern blot, 
since they can be readily distinguished from constructs containing the segments through use of 
appropriate restriction enzymes. Expression products of the transgenes can be detected in any of 
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a variety of ways, depending upon the nature of the product, and include Western blot and 
enzyme assay. One particularly useful way to quantitate protein expression and to detect 
replication in different plant tissues is to use a reporter gene, such as GUS. Once transgenic 
plants have been obtained, they may be grown to produce plant tissues or parts having the 
desired phenotype. The plant tissue or plant parts may be harvested, and/or the seed collected. 
The seed may serve as a source for growing additional plants with tissues or parts having the 
desired characteristics. 

The invention thus provides a transformed plant or plant part, such as an ear, seed, fruit, 
grain, stover, chaff, or bagasse comprising at least one polynucleotide, expression cassette or 
vector of the invention, methods of making such a plant and methods of using such a plant or a 
part thereof. The transformed plant or plant part expresses a processing enzyme, optionally 
localized in a particular cellular or subcellular compartment of a certain tissue or in developing 
grain. For instance, the invention provides a transformed plant part comprising at least one 
starch processing enzyme present in the cells of the plant, wherein the plant part is obtained from 
a transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one starch processing enzyme. The processing enzyme does not act on the target 
substrate unless activated by methods such as heating, grinding, or other methods, which allow 
the enzyme to contact the substrate under conditions where the enzyme is active 

Exemplary Methods of the Present Invention 

The self-processing plants and plant parts of the present invention may be used in various 
methods employing the processing enzymes (mesophilic, thermophilic, or hyperthermophilic) 
expressed and activated therein. In accordance with the present invention, a transgenic plant part 
obtained from a transgenic plant the genome of which is augmented with at least one processing 
enzyme, is placed under conditions in which the processing enzyme is expressed and activated. 
Upon activation, the processing enzyme is activated and functions to act on the substrate in 
which it normally acts to obtained the desired result. For example, the starch-processing 
enzymes act upon starch to degrade, hydrolyze, isomerize, or otherwise modify to obtain the 
desired result upon activation. Non-starch processing enzymes may be used to disrupt the plant 
cell membrane in order to facilitate the extraction of starch, lipids, amino acids, or other products 
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from the plants. Moreover, non-hyperthennophilic and hyperthermophilic enzymes may be used 
in combination in the self-processing plant or plant parts of the present invention. For example, 
a mesophilic non-starch degrading enzyme may be activated to disrupt the plant cell membrane 
for starch extraction, and subsequently, a hyperthermophilic starch-degrading enzyme may then 
be activated in the self-processing plant to degrade the starch. 

Enzymes expressed in grain can be activated by placing the plant or plant part containing 
them in conditions in which their activity is promoted. For example, one or more of the 
following techniques may be used: The plant part may be contacted with water, which provides 
a substrate for a hydrolytic enzyme and thus will activate the enzyme. The plant part may be 
contacted with water which will allow enzyme to migrate from the compartment into which it 
was deposited during development of the plant part and thus to associate with its substrate. 
Movement of the enzyme is possible because compartmentalization is breached during 
maturation, drying of grain and re-hydration. The intact or cracked grain may be contacted with 
water which will allow enzyme to migrate from the compartment into which it was deposited 
during development of the plant part and thus to associate with its substrate. Enzymes can also 
be activated by addition of an activating compound. For example, a calcium-dependent enzyme 
can be activated by addition of calcium. Other activating compounds may determined by those 
skilled in the art. Enzymes can be activated by removal of an inactivator. For example, there are 
known peptide inhibitors of amylase enzymes, the amylase could be co-expressed with an 
amylase inhibitor and then activated by addition of a protease. Enzymes can be activated by 
alteration of pH to one at which the enzyme is most active. Enzymes can also be activated by 
increasing temperature. An enzyme generally increases in activity up to the maximal 
temperature for that enzyme. A mesophilic enzyme will increase in activity from the level of 
activity ambient temperature up to the temperature at which it loses activity which is typically 
less than or equal to 70 °C. Similarly thermophilic and hyperthermophilic enzymes can also be 
activated by increasing temperature. Thermophilic enzymes can be activated by heating to 
temperatures up to the maximal temperature of activity or of stability. For a thermophilic 
enzyme the maximal temperatures of stability and activity will generally be between 70 and 85 
°C. Hyperthermophilic enzymes will have the even greater relative activation than mesophilic or 
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thermophilic enzymes because of the greater potential change in temperature from 25 °C up to 
85 °C to 95 °C or even 100 °C. The increased temperature may be achieved by any method, for 
example by heating such as by baking, boiling, heating, steaming, electrical discharge or any 
combination thereof Moreover, in plants expressing mesophilic or thermophilic enzyme(s), 
activation of the enzyme may be accomplished by grinding, thereby allowing the enzyme to 
contact the substrate. 

The optimal conditions, e.g., temperature, hydration, pH, etc, may be determined by one 
having skill in the art and may depend upon the individual enzyme being employed and the 
desired application of the enzyme. 

The present invention further provides for the use of exogenous enzymes that may assist 
in a particular process. For example, the use of a self-processing plant or plant part of the 
present invention may be used in combination with an exogenously provided enzyme to facilitate 
the reaction. As an example, transgenic a-amylase corn may be used in combination with other 
starch-processing enzymes, such as pullulanase, a-glucosidase, glucose isomerase, mannanases, 
hemicellulases, etc., to hydrolyze starch or produce ethanol. In fact, it has been found that 
combinations of the transgenic a-amylase corn with such enzymes has unexpectedly provided 
superior degrees of starch conversion relative to the use of transgenic a-amylase corn alone. 

Example of suitable methods contemplated herein are provided. 

a. Starch Extraction From Plants 

The invention provides for a method of facilitating the extraction of starch from plants. 
In particular, at least one polynucleotide encoding a processing enzyme that disrupts the 
physically restraining matrix of the endosperm (cell walls, non-starch polysaccharide, and 
protein matrix) is introduced to a plant so that the enzyme is preferably in close physical 
proximity to starch granules in the plant. In this embodiment of the invention, transformed 
plants express one or more protease, glucanase, xylanase, thioredoxin/thioredoxin reductase, 
cellulase, phytase, lipase, beta glucosidase, esterase and the like, but not enzymes that have any 
starch degrading activity, so as to maintain the integrity of the starch granules. The expression of 
these enzymes in a plant part such as grain thus improves the process characteristics of grain. 
The processing enzyme may be mesophilic, thermophilic, or hyperthermophilic. In one example, 
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grain from a transformed plant of the invention is heat dried, likely inactivating non- 
hyperthenmophilic processing enzymes and improving seed integrity. Grain (or cracked grain) is 
steeped at low temperatures or high temperatures (where time is of the essence) with high or low 
moisture content or conditions (see Primary Cereal Processing, Gordon and Willm, eds., pp. 319- 
337 (1994), the disclosure of which is incorporated herein), with or without sulphur dioxide. 
Upon reaching elevated temperatures, optionally at certain moisture conditions, the integrity of 
the endosperm matrix is disrupted by activating the enzymes, e.g., proteases, xylanases, phytase 
or glucanases which degrade the proteins and non-starch polysaccharides present in the 
endosperm leaving the starch granule therein intact and more readily recoverable from the 
resulting material. Further, the proteins and non-starch polysaccharides in the effluent are at 
least partially degraded and highly concentrated, and so may be used for improved animal feed, 
food, or as media components for the fermentation of microorganisms. The effluent is 
considered a corn-steep liquor with improved composition. 

Thus, the invention provides a method to prepare starch granules. The method comprises 
treating grain, for example cracked grain, which comprises at least one non-starch processing 
enzyme under conditions which activate the at least one enzyme, yielding a mixture comprising 
starch granules and non-starch degradation products, e.g., digested endosperm matrix products. 
The non-starch processing enzyme may be mesophilic, thermophilic, or hyperthermophilic. 
After activation of the enzyme, the starch granules are separated from the mixture. The grain is 
obtained from a transformed plant, the genome of which comprises (is augmented with) an 
expression cassette encoding the at least one processing enzyme. For example, the processing 
enzyme may be a protease, glucanase, xylanase, phytase, thiroredoxin/thioredoxin reductase, 
esterase cellulase, lipase, or a beta glucosidase. The processing enzyme may be 
hyperthermophilic. The grain can be treated under low or high moisture conditions, in the 
presence or absence of sulfur dioxide. Depending on the activity and expression level of the 
processing enzyme in the grain from the transgenic plant, the transgenic grain may be mixed 
with commodity grain prior to or during processing. Also provided are products obtained by the 
method such as starch, non-starch products and improved steepwater comprising at least one 

additional component. 

b. Starch-Processinfi Methods 
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Transformed plants or plant parts of the present invention may comprise starch-degrading 
enzymes as disclosed herein that degrade starch granules to dextrins, other modified starches, or 
hexoses (e.g., a-amylase, pullulanase, a-glucosidase, glucoamylase, amylopullulanase) or 
convert glucose into fructose (e.g., glucose isomerase). Preferably, the starch-degrading enzyme 
is selected from a-amylase, a-glucosidase, glucoamylase, pullulanase, neopullulanase, 
amylopullulanase, glucose isomerase, and combinations thereof is used to transform the grain. 
Moreover, preferably, the enzyme is operably linked to a promoter and to a signal sequence that 
targets the enzyme to the starch granule, an amyloplast, the apoplast, or the endoplasmic 
reticulum. Most preferably, the enzyme is expressed in the endosperm, and particularly, com 
endosperm, and localized to one or more cellular compartments, or within the starch granule 
itself. The preferred plant part is grain. Preferred plant parts are those from corn, wheat, barley, 
rye, oat, sugar cane, or rice. 

In accordance with one starch-degrading method of the present invention, the 
transformed grain accumulates the starch-degrading enzyme in starch granules, is steeped at 
conventional temperatures of 50°C-60°C, and wet-milled as is known in the art. Preferably, the 
starch-degrading enzyme is hyperthermophilic. Because of sub-cellular targeting of the enzyme 
to the starch granule, or by virtue of the association of the enzyme with the starch granule, by 
contacting the enzyme and starch granule during the wet-milling process at the conventional 
temperatures, the processing enzyme is co-purified with the starch granules to obtain the starch 
granules/enzyme mixture. Subsequent to the recovery of the starch granules/enzyme mixture, 
the enzyme is then activated by providing favorable conditions for the activity of the enzyme. 
For example, the processing may be performed in various conditions of moisture and/or 
temperature to facilitate the partial (in order to make derivatized starches or dextrins) or 
complete hydrolysis of the starch into hexoses. Syrups containing high dextrose or fructose 
equivalents are obtained in this manner. This method effectively reduces the time, energy, and 
enzyme costs and the efficiency with which starch is converted to the corresponding hexose, and 
the efficiency of the production of products, like high sugar steepwater and higher dextrose 
equivalent syrups, are increased. 
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In another embodiment, a plant, or a product of the plant such as a fruit or grain, or flour 
made from the grain that expresses the enzyme is treated to activate the enzyme and convert 
polysaccharides expressed and contained within the plant into sugars. Preferably, the enzyme is 
fused to a signal sequence that targets the enzyme to a starch granule, an amyloplast, the apoplast 
or to the endoplasmic reticulum as disclosed herein. The sugar produced may then be isolated or 
recovered from the plant or the product of the plant. In another embodiment, a processing enzyme 
able to convert polysaccharides into sugars is placed under the control of an inducible promoter 
according to methods known in the art and disclosed herein. The processing enzyme may be 
mesophilic, thermophilic or hyperthermophilic. The plant is grown to a desired stage and the 
promoter is induced causing expression of the enzyme and conversion of the polysaccharides, 
within the plant or product of the plant, to sugars. Preferably the enzyme is operably linked to a 
signal sequence that targets the enzyme to a starch granule, an amyloplast, an apoplast or to the 
endoplasmic reticulum. In another embodiment, a transformed plant is produced that expresses a 
processing enzyme able to convert starch into sugar. The enzyme is fused to a signal sequence that 
targets the enzyme to a starch granule within the plant. Starch is then isolated from the 
transformed plant that contains the enzyme expressed by the transformed plant. The enzyme 
contained in the isolated starch may then be activated to convert the starch into sugar. The enzyme 
may be mesophilic, thermophilic, or hyperthermophilic. Examples of hyperthermophilic enzymes 
able to convert starch to sugar are provided herein. The methods may be used with any plant 
which produces a polysaccharide and that can express an enzyme able to convert a polysaccharide 
into sugars or hydrolyzed starch product such as dextrin, maltooligosaccharide, glucose and/or 
mixtures thereof. 

The invention provides a method to produce dextrins and altered starches from a plant, or 

a product from a plant, that has been transformed with a processing enzyme which hydrolyses 

certain covalent bonds of a polysaccharide to form a polysaccharide derivative. In one 

embodiment, a plant, or a product of the plant such as a fruit or grain, or flour made from the 

grain that expresses the enzyme is placed under conditions sufficient to activate the enzyme and 

convert polysaccharides contained within the plant into polysaccharides of reduced molecular 

weight. Preferably, the enzyme is fused to a signal sequence that targets the enzyme to a starch 

granule, an amyloplast, the apoplast or to the endoplasmic reticulum as disclosed herein. The 
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dextrin or derivative starch produced may then be isolated or recovered from the plant or the 
product of the plant. In another embodiment, a processing enzyme able to convert 
polysaccharides into dextrins or altered starches is placed under the control of an inducible 
promoter according to methods known in the art and disclosed herein. The plant is grown to a 
desired stage and the promoter is induced causing expression of the enzyme and conversion of 
the polysaccharides, within the plant or product of the plant, to dextrins or altered starches. 
Preferably the enzyme is a-amylase, pullulanase, iso or neo-pullulanase and is operably linked to 
a signal sequence that targets the enzyme to a starch granule, an amyloplast, the apoplast or to 
the endoplasmic reticulum. In one embodiment, the enzyme is targeted to the apoplast or to the 
endoreticulum. In yet another embodiment, a transformed plant is produced that expresses an 
enzyme able to convert starch into dextrins or altered starches. The enzyme is fused to a signal 
sequence that targets the enzyme to a starch granule within the plant. Starch is then isolated 
from the transformed plant that contains the enzyme expressed by the transformed plant. The 
enzyme contained in the isolated starch may then be activated under conditions sufficient for 
activation to convert the starch into dextrins or altered starches. Examples of hyperthermophilic 
enzymes, for example, able to convert starch to hydrolyzed starch products are provided herein. 
The methods may be used with any plant which produces a polysaccharide and that can express 
an enzyme able to convert a polysaccharide into sugar. 

In another embodiment, grain from transformed plants of the invention that accumulate 
starch-degrading enzymes that degrade linkages in starch granules to dextrins, modified starches 
or hexose (e.g., a-amylase, pullulanase, a-glucosidase, glucoamylase, amylopullulanase) is 
steeped under conditions favoring the activity of the starch degrading enzyme for various periods 
of time. The resulting mixture may contain high levels of the starch-derived product. The use of 
such grain: 1) eliminates the need to mill the grain, or otherwise process the grain to first obtain 
starch granules, 2) makes the starch more accessible to enzymes by virtue of placing the enzymes 
directly within the endosperm tissue of the grain, and 3) eliminates the need for microbially 
produced starch-hydrolyzing enzymes. Thus, the entire process of wet-milling prior to hexose 
recovery is eliminated by simply heating grain, preferably com grain, in the presence of water to 
allow the enzymes to act on the starch. 
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This process can also be employed for the production of ethanol, high fructose syrups, 
hexose (glucose) containing fermentation media, or any other use of starch that does not require 
the refinement of grain components. 

The invention further provides a method of preparing dextrin, maltooligosaccharides, 
and/or sugar involving treating a plant part comprising starch granules and at least one starch 
processing enzyme under conditions so as to activate the at least one enzyme thereby digesting 
starch granules to form an aqueous solution comprising sugars. The plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one processing enzyme. The aqueous solution comprising dextrins, 
maltooligosaccharides, and/or sugar is then collected. In one embodiment, the processing 
enzyme is a-amylase, a-glucosidase, pullulanase, glucoamylase, amylopullulanase, glucose 
isomerase, or any combination thereof. Preferably, the enzyme is hyperthermophilic. In another 
embodiment, the method further comprises isolating the dextrins, maltooligosaccharides, and/or 
sugar. 

c. Improved Corn Varieties 

The invention also provides for the production of improved corn varieties (and 
varieties of other crops) that have normal levels of starch accumulation, and accumulate 
sufficient levels of amylolytic enzyme(s) in their endosperm, or starch accumulating organ, such 
that upon activation of the enzyme contained therein, such as by boiling or heating the plant or a 
part thereof in the case of a hyperthermophilic enzyme, the enzyme(s) is activated and facilitates 
the rapid conversion of the starch into simple sugars. These simple sugars (primarily glucose) 
will provide sweetness to the treated corn. The resulting corn plant is an improved variety for 
dual use as a grain 

producing hybrid and as sweet com. Thus, the invention provides a method to produce 
hyper-sweet corn, comprising treating transformed corn or a part thereof, the genome of 
which is augmented with and expresses in endosperm an expression cassette comprising a 
promoter operably linked to a first polynucleotide encoding at least one amylolytic enzyme, 
conditions which activate the at least one enzyme so as to convert polysaccharides in the corn 
into sugar, yielding hypersweet com. The promoter may be a constitutive promoter, a seed- 
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specific promoter, or an endosperm-specific promoter which is linked to a polynucleotide 
sequence which encodes a processing enzyme such 

as a-amylase, e.g., one comprising SEQ ED NO: 13, 14, or 16. Preferably, the enzyme is 
hyperthermophilic. In one embodiment, the expression cassette further comprises a second 
polynucleotide which encodes a signal sequence operably linked to the enzyme encoded by the 
first polynucleotide. Exemplary signal sequences in this embodiment of the invention direct the 
enzyme to apoplast, the endoplasmic reticulum, a starch granule, or to an amyloplast. The corn 
plant is grown such that the ears with kernels are formed and then the promoter is induced to 
cause the enzyme to be expressed and convert polysaccharide contained within the plant into 
sugar. 

d. Self-Fermenting Plants 

Ln another embodiment of the invention, plants, such as com, rice, wheat, or sugar 
cane are engineered to accumulate large quantities of processing enzymes in their cell walls, e.g., 
xylanases, cellulases, hemicellulases, glucanases, pectinases, lipases, esterases, beta 
glucosidases, phytases, proteases and the like (non-starch polysaccharide degrading enzymes). 
Following the harvesting of the grain component (or sugar in the case of sugar cane), the stover, 
chaff, or bagasse is used as a source of the enzyme, which was targeted for expression and 
accumulation in the cell walls, and as a source of biomass. The stover (or other left-over tissue) 
is used as a feedstock in a process to recover fermentable sugars. The process of obtaining the 
fermentable sugars consists of activating the non-starch polysaccharide degrading enzyme. For 
example, activation may comprise heating the plant tissue in the presence of water for periods of 
time adequate for the hydrolysis of the non-starch polysaccharide into the resulting sugars. Thus, 
this self-processing stover produces the enzymes required for conversion of polysaccharides into 
monosaccharides, essentially at no incremental cost as they are a component of the feedstock. 
Further, the temperature-dependent enzymes have no detrimental effects on plant growth and 
development, and cell wall targeting, even targeting into polysaccharide microfibrils by virtue of 
cellulose/xylose binding domains ftised to the protein, improves the accessibility of the substrate 
to the enzyme. 

Thus, the invention also provides a method of using a transformed plant part comprising 
at least one non-starch polysaccharide processing enzyme in the cell wall of the cells of the plant 
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part. The method comprises treating a transformed plant part comprising at least one non-starch 
polysaccharide processing enzyme under conditions which activate the at least one enzyme 
thereby digesting starch granules to form an aqueous solution comprising sugars, wherein the 
plant part is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one non-starch polysaccharide processing enzyme; and 
collecting the aqueous solution comprising the sugars. The invention also includes a transformed 
plant or plant part comprising at least one non-starch polysaccharide processing enzyme present 
in the cell or cell wall of the cells of the plant or plant part. The plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one non-starch processing enzyme, e.g., a xylanase, cellulase, glucanase, pectinase, 
lipase, esterase, beta glucosidase, phytase, protease or any combination thereof. 
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e. Aoueous Phase High In Protein and Sugar Content 

In yet another embodiment, proteases and lipases are engineered to accumulate in seeds, 
e.g., soybean seeds. After activation of the protease or lipase, such as, for example, by heating, 
these enzymes in the seeds hydrolyze the lipid and storage proteins present in soybeans during 
processing. Soluble products comprising amino acids, which can be used as feed, food or 
fermentation media, and fatty acids, can thus be obtained. Polysaccharides are typically found in 
the insoluble fraction of processed grain. However, by combining polysaccharide degrading 
enzyme expression and accumulation in seeds, proteins and polysaccharides can be hydrolyzed 
and are found in the aqueous phase. For example, zeins from corn and storage protein and non- 
starch polysaccharides from soybean can be solubilized in this manner. Components of the 
aqueous and hydrophobic phases can be easily separated by extraction with organic solvent or 
supercritical carbon dioxide. Thus, what is provided is a method for producing an aqueous 
extract of grain that contains higher levels of protein, amino acids, sugars or saccharides. 

f. Self-Processing Fermentation 

The invention provides a method to produce ethanol, a fermented beverage, or other 
fermentation-derived product(s). The method involves obtaining a plant, or the product or part of a 
plant, or plant derivative such as grain flour, wherein a processing enzyme that converts 
polysaccharides into sugar is expressed, The plant, or product thereof, is treated such that sugar is 
produced by conversion of the polysaccharide as described above. The sugars and other 
components of the plant are then fermented to form ethanol or a fermented beverage, or other 
fermentation-derived products, according to methods known in the art. See, for example, U.S. 
Patent No.: 4,929,452. Briefly the sugar produced by conversion of polysaccharides is incubated 
with yeast under conditions that promote conversion of the sugar into ethanol. A suitable yeast 
includes high alcohol-tolerant and high-sugar tolerant strains of yeast, such as, for example, the 
yeast, S. cerevisiae ATCC No. 20867. This strain was deposited with the American Type Culture 
Collection, Rockville, MD, on Sept. 17, 1987 and assigned ATCC No. 20867. The fermented 
product or fermented beverage may then be distilled to isolate ethanol or a distilled beverage, or 
the fermentation product otherwise recovered. The plant used in this method may be any plant 
that contains a polysaccharide and is able to express an enzyme of the invention. Many such 

plants are disclosed herein. Preferably the plant is one that is grown commercially. More 
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preferably the plant is one that is normally used to produce ethanol or fermented beverages, or 
fermented products, such as, for example, wheat, barley, com, rye, potato, grapes or rice. 

The method comprises treating a plant part comprising at least one polysaccharide 
processing enzyme under conditions to activate the at least one enzyme thereby digesting 
polysaccharide in the plant part to form fermentable sugar. The polysaccharide processing 
enzyme may be mesophilic, thermophilic, or hyperthermophilic. The plant part is obtained from 
a transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one polysaccharide processing enzyme. Plant parts for this embodiment of the invention 
include, but are not limited to, grain, fruit, seed, stalk, wood, vegetable or root. Plants include 
but are not limited to oat, barley, wheat, berry, grape, rye, corn, rice, potato, sugar beet, sugar 
cane, pineapple, grass and tree. The plant part may be combined with commodity grain or other 
commercially available substrates; the source of the substrate for processing may be a source 
other than the self-processing plant. The fermentable sugar is then incubated under conditions 
that promote the conversion of the fermentable sugar into ethanol, e.g., with yeast and/or other 
microbes. Ln an embodiment, the plant part is derived from corn transformed with onamylase, 
which has been found to reduce the amount of time and cost of fermentation. 

It has been found that the amount of residual starch is reduced when transgenic corn 
made in accordance with the present invention expressing a thermostable a-amylase, for 
example, is used in fermentation. This indicates that more starch is solubilized during 
fermentation. The reduced amount of residual starch results in the distillers' grains having 
higher protein content by weight and higher value. Moreover, the fermentation of the transgenic 
corn of the present invention allows the liquefaction process to be performed at a lower pH, 
resulting in savings in the cost of chemicals used to adjust the pH, at a higher temperature, e.g., 
greater than 85°C, preferably, greater than 90°C, more preferabty, 95°C or higher, resulting in 
shorter liquefaction times and more complete solubilization of starch, and reduction of 
liquefaction times, all resulting in efficient fermentation reactions with higher yields of ethanol. 

Moreover, it has been found that contacting conventional plant parts with even a small 

portion of the transgenic plant made in accordance with the present invention may reduce the 

fermentation time and costs associated therewith. As such, the present invention relates to the 

reduction in the fermentation time for plants comprising subjecting a transgenic plant part from a 
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plant comprising a polysaccharide processing enzyme that converts polysaccharides into sugar 
relative to the use of a plant part not comprising the polysaccharide processing enzyme. 

g. Raw Starch Processing Enzymes And Polynucleotides Encoding Them 
A polynucleotide encoding a mesophilic processing enzyme(s) is introduced into a plant 
or plant part. In an embodiment, the polynucleotide of the present invention is a maize- 
optimized polynucleotide such as provided in SEQ ED NOs: 48, 50, and 59, encoding a 
glucoamylase, such as provided in SEQ ED NOs: 47, and 49. In another embodiment, the 
polynucleotide of the present invention is a maize-optimized polynucleotide such as provided in 
SEQ ID NO: 52, encoding an alpha-amylase, such as provided in SEQ ID NO: 51. Moreover, 
fusion products of processing enzymes are further contemplated. In one embodiment, the 
polynucleotide of the present invention is a maize-optimized polynucleotide such as provided in 
SEQ ED NO: 46, encoding an alpha-amylase and glucoamylase fusion, such as provided in SEQ 
ID NO: 45. Combinations of processing enzymes are further envisioned by the present 
invention. For example, a combination of starch-processing enzymes and non-starch processing 
enzymes is contemplated herein. Such combinations of processing enzymes may be obtained by 
employing the use of multiple gene constructs encoding each of the enzymes. Alternatively, the 
individual transgenic plants stably transformed with the enzymes may be crossed by known 
methods to obtain a plant containing both enzymes. Another method includes the use of 
exogenous enzyme(s) with the transgenic plant. 

The source of the starch-processing and non-starch processing enzymes may be isolated 
or derived from any source and the polynucleotides corresponding thereto may be ascertained by 
one having skill in the art. The a-amylase may be derived from Aspergillus (e.g., Aspergillus 
shirousami and Aspergillus niger), Rhizopus (eg., Rhizopus oryzae), and plants such as corn, 
barley, and rice. The glucoamylase may be derived from Aspergillus (e.g., Aspergillus 
shirousami and Aspergillus niger), Rhizopus (eg., Rhizopus oryzae), and Thermoanaerobacter 
(eg., Thermoanaerobacter thermosaccharolyticum). 

In another embodiment of the invention, the polynucleotide encodes a mesophilic starch- 
processing enzyme that is operably linked to a maize-optimized polynucleotide such as provided 
in SEQ ID NO: 54, encoding a raw starch binding domain, such as provided in SEQ ID NO: 53. 
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In another embodiment, a tissue-specific promoter includes the endosperm-specific 
promoters such as the maize 7-zein promoter (exemplified by SEQ ID NO: 12) or the maize 
ADP-gpp promoter (exemplified by SEQ ID NO: 1 1 , which includes a 5' untranslated and an 
intron sequence) or a Q protein promoter (exemplified by SEQ ID NO: 98) or a rice glutelin 
promoter (exemplified by SEQ ID NO: 67) . Thus, the present invention includes an isolated 
polynucleotide comprising a promoter comprising SEQ ED NO: 1 1, 12, 67, or 98, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 
conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ID NO: 1 1 , 1 2, 67 or 98. 

In one embodiment, the product from a starch-hydrolysis gene, such as a-amylase, 
glucoamylase, or a-amylase/glucoamylase fusion may be targeted to a particular organelle or 
location such as the endoplasmic reticulum or apoplast, rather than to the cytoplasm. This is 
exemplified by the use of the maize y-zein N-terminal signal sequence (SEQ ID NO: 17), which 
confers apoplast-specific targeting of proteins, and the use of the y-zein N-terminal signal 
sequence (SEQ ID NO: 17) which is operably linked to the processing enzyme that is operably 
linked to the sequence SEKDEL for retention in the endoplasmic reticulum. Directing the 
protein or enzyme to a specific compartment will allow the enzyme to be localized in a manner 
that it will not come into contact with the substrate. In this manner the enzymatic action of the 
enzyme will not occur until the enzyme contacts its substrate. The enzyme can be contacted with 
its substrate by the process of milling (physical disruption of the cell integrity) and hydrating. 
. For example, a mesophilic starch-hydrolyzing enzyme can be targeted to the apoplast or to the 
endoplasmic reticulum and will therefore not come into contact with starch granules in the 
amyloplast, Milling of the grain will disrupt the integrity of the grain and the starch hydrolyzing 
enzyme will then contact the starch granules. In this manner the potential negative effects of co- 
localization of an enzyme and its substrate can be circumvented, 
h. Food Products Without Added Sweetener 

Also provided is a method to produce a sweetened farinaceous food product 
without adding additional sweetener. Examples of farinaceous products include, but are not 
limited to, breakfast food, ready to eat food, baked food, pasta and cereal products such as 
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breakfast cereal. The method comprises treating a plant part comprising at least one starch 
processing enzyme under conditions which activate the starch processing enzyme, thereby 
processing starch granules in the plant part to sugars so as to form a sweetened product, e.g., 
relative to the product produced by processing starch granules from a plant part which does not 
comprise the hyperthermophilic enzyme. Preferably, the starch processing enzyme is 
hyperthermophilic and is activated by heating, such as by baking, boiling, heating, steaming, 
electrical discharge, or any combination thereof. The plant part is obtained from a transformed 
plant, for instance from transformed soybean, rye, oat, barley, wheat, corn, rice or sugar cane, the 
genome of which is augmented with an expression cassette encoding the at least one 
hyperthermophilic starch processing enzyme, e.g., a-amylase, a-glucosidase, glucoamylase, 
pullulanase, glucose isomerase, or any combination thereof. The sweetened product is then 
processed into a farinaceous food product. The invention also provides a farinaceous food 
product, e.g., a cereal food, a breakfast food, a ready to eat food, or a baked food, produced by 
the method. The farinaceous food product may be formed from the sweetened product and 
water, and may contain malt, flavorings, vitamins, minerals, coloring agents or any combination 
thereof. 

The enzyme may be activated to convert polysaccharides contained within the plant 
material into sugar prior to inclusion of the plant material into the cereal product or during the 
processing of the cereal product. Accordingly, polysaccharides contained within the plant 
material may be converted into sugar by activating the material, such as by heating in the case of 
a hyperthermophilic enzyme, prior to inclusion in the farinaceous product. The plant material 
containing sugar produced by conversion of the polysaccharides is then added to the product to 
produce a sweetened product. Alternatively, the polysaccharides may be converted into sugars 
by the enzyme during the processing of the farinaceous product. Examples of processes used to 
make cereal products are well known in the art and include heating, baking, boiling and the like 
as described in U.S. Patent Nos.: 6,183,788; 6,159,530; 6,149,965; 4,988,521 and 5,368,870. 

Briefly, dough may be prepared by blending various dry ingredients together with water 
and cooking to gelatinize the starchy components and to develop a cooked flavor. The cooked 
material can then be mechanically worked to form a cooked dough, such as cereal dough. The 
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dry ingredients may include various additives such as sugars, starch, salt, vitamins, minerals, 
colorings, flavorings, salt and the like. In addition to water, various liquid ingredients such as 
corn (maize) or malt syrup can be added. The farinaceous material may include cereal grains, 
cut grains, grits or flours from wheat, rice, corn, oats, barley, rye, or other cereal grains and 
mixtures thereof from that a transformed plant of the invention. The dough may then be 
processed into a desired shape through a process such as extrusion or stamping and further 
cooked using means such as a James cooker, an oven or an electrical discharge device. 

Further provided is a method to sweeten a starch containing product without 
adding sweetener. The method comprises treating starch comprising at least one 
starch processing enzyme conditions to activate the at least one enzyme thereby digesting the 
starch to form a sugar thereby forming a treated (sweetened) starch, e.g., relative to the product 
produced by treating starch which does not comprise the hyperthermophilic enzyme. The starch 
of the invention is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one processing enzyme. Enzymes include ct-amylase, 
a-glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
enzyme may be hyperthermophilic and activated with heat. Preferred transformed plants include 
com, soybean, rye, oat, barley, wheat, rice and sugar cane. The treated starch is then added to a 
product to produce a sweetened starch containing product, e.g., a farinaceous food product. Also 
provided is a sweetened starch containing product produced by the method. 

The invention further provides a method to sweeten a polysaccharide containing 
fruit or vegetable comprising: treating a fruit or vegetable comprising at least one 
polysaccharide processing enzyme under conditions which activate the at least one enzyme, 
thereby processing the polysaccharide in the fruit or vegetable to form sugar, yielding a 
sweetened fruit or vegetable, e.g., relative to a fruit or vegetable from a plant which does not 
comprise the polysaccharide processing enzyme. The fruit or vegetable of the invention is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one polysaccharide processing enzyme. 
Fruits and vegetables include potato, tomato, banana, squash, pea, and bean. 
Enzymes include a-amylase, a-glucosidase, glucoamylase, pullulanase, glucose 
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isomerase, or any combination thereof. The enzyme may be hyperthermophilic. 
i. Sweetening a polysaccharide containing plant or plant product 
The method involves obtaining a plant that expresses a polysaccharide processing 
enzyme which converts a polysaccharide into a sugar as described above. Accordingly the 
enzyme is expressed in the plant and in the products of the plant, such as in a fruit or vegetable. 
In one embodiment, the enzyme is placed under the control of an inducible promoter such that 
expression of the enzyme may be induced by an external stimulus. Such inducible promoters 
and constructs are well known in the art and are described herein. Expression of the enzyme 
within the plant or product thereof causes polysaccharide contained within the plant or product 
thereof to be converted into sugar and to sweeten the plant or product thereof. In another 
embodiment, the polysaccharide processing enzyme is constitutively expressed. Thus, the plant 
or product thereof may be activated under conditions sufficient to activate the enzyme to convert 
the polysaccharides into sugar through the action of the enzyme to sweeten the plant or product 
thereof As a result, this self-processing of the polysaccharide in the fruit or vegetable to form 
sugar yields a sweetened fruit or vegetable, e.g., relative to a fruit or vegetable from a plant 
which does not comprise the polysaccharide processing enzyme. The fruit or vegetable of the 
invention is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one polysaccharide processing enzyme. Fruits and 

« 

vegetables include potato, tomato, banana, squash, pea, and bean. Enzymes include oc-amylase, 
a- glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
polysaccharide processing enzyme may be hyperthermophilic. 

j, Isolation of starch from transformed grain that cont ains a enzvme which 
disrupts the endosperm matrix 

The invention provides a method to isolate starch from a transformed grain wherein an 
enzyme is expressed that disrupts the endosperm matrix. The method involves obtaining a plant 
that expresses an enzyme which disrupts the endosperm matrix by modification of, for example, 
cell walls, non-starch polysaccharides and/or proteins. Examples of such enzymes include, but 
are not limited to, proteases, glucanases, thioredoxin, thioredoxin reductase, phytases, lipases, 
cellulases, beta glucosidases, xylanases and esterases. Such enzymes do not include any enzyme 
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that exhibits starch-degrading activity so as to maintain the integrity of the starch granules. The 
enzyme may be fused to a signal sequence that targets the enzyme to the starch granule. In one 
embodiment the grain is heat dried to activate the enzyme and inactivate the endogenous 
enzymes contained within the grain. The heat treatment causes activation of the enzyme, which 
acts to disrupt the endosperm matrix which is then easily separated from the starch granules. In 
another embodiment, the grain is steeped at low or high temperature, with high or low moisture 
content, with or without sulfur dioxide. The grain is then heat treated to disrupt the endosperm 
matrix and allow for easy separation of the starch granules. In another embodiment, proper 
temperature and moisture conditions are created to allow proteases to enter into the starch 
granules and degrade proteins contained within the granules. Such treatment would produce 
starch granules with high yield and little contaminating protein. 

k. Syrup having a high sugar equivalent and use of the syrup to produce ethanol 

or a fermented beverage 

The method involves obtaining a plant that expresses a polysaccharide processing 
enzyme which converts a polysaccharide into a sugar as described above. The plant, or product 
thereof, is steeped in an aqueous stream under conditions where the expressed enzyme converts 
polysaccharide contained within the plant, or product thereof, into dextrin, maltooligosaccharide, 
and/or sugar. The aqueous stream containing the dextrin, maltooligosaccharide, and/or sugar 
produced through conversion of the polysaccharide is then separated to produce a syrup having a 
high sugar equivalent. The method may or may not include an additional step of wet-milling the 
plant or product thereof to obtain starch granules. Examples of enzymes that may be used within 
the method include, but are not limited to, a-amylase, glucoamylase, pullulanase and a- 
glucosidase, The enzyme may be hyperthermophilic. Sugars produced according to the method 
include, but are not limited to, hexose, glucose and fructose. Examples of plants that may be 
used with the method include, but are not limited to, corn, wheat or barley. Examples of 
products of a plant that may be used include, but are not limited to, fruit, grain and vegetables. In 
one embodiment, the polysaccharide processing enzyme is placed under the control of an 
inducible promoter. Accordingly, prior to or during the steeping process, the promoter is 
induced to cause expression of the enzyme, which then provides for the conversion of 
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polysaccharide into sugar. Examples of inducible promoters and constructs containing them are 
well known in the art and are provided herein. Thus, where the polysaccharide processing is 
hyperthermophilic, the steeping is performed at a high temperature to activate the 
hyperthermophilic enzyme and inactivate endogenous enzymes found within the plant or product 
thereof. In another embodiment, a hyperthermophilic enzyme able to convert polysaccharide 
into sugar is constitutively expressed. This enzyme may or may not be targeted to a 
compartment within the plant through use of a signal sequence. The plant, or product thereof, is 
steeped under high temperature conditions to cause the conversion of polysaccharides contained 
within the plant into sugar. 

Also provided is a method to produce ethanol or a fermented beverage from syrup having 
a high sugar equivalent. The method involves incubating the syrup with yeast under conditions 
that allow conversion of sugar contained within the syrup into ethanol or a fermented beverage. 
Examples of such fermented beverages include, but are not limited to, beer and wine. 
Fermentation conditions are well known in the art and are described in U.S. Patent No.: 
4,929,452 and herein. Preferably the yeast is a high alcohol-tolerant and high-sugar tolerant strain 
of yeast such as S. cerevisiae ATCC No. 20867. The fermented product or fermented beverage 
may be distilled to isolate ethanol or a distilled beverage. 

I, Accumulation of hyperthermophilic enzyme in the cell wall of a plant 
The invention provides a method to accumulate a hyperthermophilic enzyme in the cell 
wall of a plant. The method involves expressing within a plant a hyperthermophilic enzyme that 
is fused to a cell wall targeting signal such that the targeted enzyme accumulates in the cell wall. 
Preferably the enzyme is able to convert polysaccharides into monosaccharides. Examples of 
targeting sequences include, but are not limited to, a cellulose or xylose binding domain. 
Examples of hyperthermophilic enzymes include those listed in SEQ ID NO: 1, 3, 5, 10, 13, 14, 
1 5 or 1 6. Plant material containing cell walls may be added as a source of desired enzymes in a 
process to recover sugars from the feedstock or as a source of enzymes for the conversion of 
polysaccharides originating from other sources to monosaccharides. Additionally, the cell walls 
may serve as a source from which enzymes may be purified. Methods to purify enzymes are 
well known in the art and include, but are not limited to, gel filtration, ion-exchange 

chromatography, chromatofocusing, isoelectric focusing, affinity chromatography, FPLC, 
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HPLC, salt precipitation, dialysis, and the like. Accordingly, the invention also provides purified 
enzymes isolated from the cell walls of plants. 

m. Method of preparing and isolating processing enzymes 

In accordance with the present invention, recombinantly-produced processing enzymes of 
the present invention may be prepared by transforming plant tissue or plant cell comprising the 
processing enzyme of the present invention capable of being activated in the plant, selected for 
the transformed plant tissue or cell, growing the transformed plant tissue or cell into a 
transformed plant, and isolating the processing enzyme from the transformed plant or part 
thereof. The recombinantly-produced enzyme may be an a-amylase, glucoamylase, glucose 
isomerase, ct-glucosidase, pullulinase, xylanase, protease, glucanase, beta glucosidase, esterase, 
lipase, or phytase. The enzyme may be encoded by the polynucleotide selected from any of 
SEQ ID NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, 59, 61, 63, 65, 79, 81, 83, 85, 
87, 89,91,93,94, 95,96, 97, or 99. 

The invention will be further described by the following examples, which are not 
intended to limit the scope of the invention in any manner. 

Examples 

Example 1 

Construction of maize-optimized genes for hyperthermophilic starch- 

processing/isomerization enzymes 

The enzymes, a-amylase, pullulanase, a-glucosidase, and glucose isomerase, involved in 
starch degradation or glucose isomerization were selected for their desired activity profiles. 
These include, for example, minimal activity at ambient temperature, high temperature 
activity/stability, and activity at low pH. The corresponding genes were then designed by using 
maize preferred codons as described in U.S. Patent No. 5,625,136 and synthesized by Integrated 
DNA Technologies, Inc. (Coralville, IA). 

The 797GL3 a-amylase, having the amino acid sequence SEQ ID NO:l, was selected for 
its hyperthermophilic activity. This enzyme's nucleic acid sequence was deduced and maize- 
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optimized as represented in SEQ ID NO:2. Similarly, the 6gp3 pullulanase was selected having 
the amino acid sequence set forth in SEQ ID NO:3. The nucleic acid sequence for the 6gp3 
pullulanase was deduced and maize-optimized as represented in SEQ ID NO:4. 

The amino acid sequence for malA a-glucosidase from Sulfolobus solfataricus was 
obtained from the literature, J. Bact. 177:482-485 (1995); J. Bact. 180:1287-1295 (1998), Based 
on the published amino acid sequence of the protein (SEQ ID NO:5), the maize-optimized 
synthetic gene (SEQ ID NO:6) encoding the malA a-glucosidase was designed. 

Several glucose isomerase enzymes were selected. The amino acid sequence (SEQ ID 
NO: 18) for glucose isomerase derived from Thermotoga maritima was predicted based on the 
published DNA sequence having Accession No. NC_000853 and a maize-optimized synthetic 
gene was designed (SEQ ED NO: 19). Similarly the amino acid sequence (SEQ ID NO:20) for 
glucose isomerase derived from Thermotoga neapolitana was predicted based on the published 
DNA sequence from Appl. Envir. Microbiol. 61(5): 1 867-1 875 (1995), Accession No. L38994. 
A maize-optimized synthetic gene encoding the Tfiermotoga neapolitana glucose isomerase was 
designed (SEQ ID NO:21). 

Example 2 

Expression of ftision of 797GL3 a-amvlase and starch encapsulating region in E. coli 
A construct encoding hyperthermophilic 797GL3 a-amylase fused to the starch 
encapsulating region (SER) from maize granule-bound starch synthase (waxy) was introduced 
and expressed in E. coli. The maize granule-bound starch synthase cDNA (SEQ ED NO:7) 
encoding the amino acid sequence (SEQ ID NO:8)(Klosgen RB, et al. 1986) was cloned as a 
source of a starch binding domain, or starch encapsulating region (SER). The full-length cDNA 
was amplified by RT-PCR from RNA prepared from maize seed using primers SV57 
(5'AGCGAATTCATGGCGGCTCTGGCCACGT 3') (SEQ ID NO: 22) and SV58 
(5 ' AGCT AAGCTTC AGGGCGCGGCCACGTTCT 3') (SEQ ID NO: 23) designed from 
GenBank Accession No. X03935. The complete cDNA was cloned into pBluescript as an 
EcoRJ/Hindlll fragment and the plasmid designated pNOV4022. 



75 



WO 2005/096804 



PCT/US2004/007182 



The C-terminal portion (encoded by bp 919-1818) of the waxy cDNA, including the 
starch-binding domain, was amplified from pNOV4022 and fused in-frame to the 3' end of the 
full-length maize-optimized 797GL3 gene (SEQ ID NO:2). The fused gene product, 
797GL3/Waxy, having the nucleic acid SEQ DD NO:9 and encoding the amino acid sequence, 
SEQ ID NO: 10, was cloned as an Ncol/Xbal fragment into pET28b (NOVAGEN, Madison, WI) 
that was cut with Ncol/Nhel. The 797GL3 gene alone was also cloned into the pET28b vector as 
an Ncol/Xbal fragment. 

The pET28/797GL3 and the pET28/797GL3AVaxy vectors were transformed into 
BL21/DE3 E. coli cells (NOVAGEN) and grown and induced according to the manufacturer's 
instruction. Analysis by PAGE/Coomassie staining revealed an induced protein in both extracts 
corresponding to the predicted sizes of the flised and unfiised amylase, respectively. 

Total cell extracts were analyzed for hyperthermophilic amylase activity as follows: 5 
mg of starch was suspended in 20 jal of water then diluted with 25 \x\ of ethanol. The standard 
amylase positive control or the sample to be tested for amylase activity was added to the mixture 
and water was added to a final reaction volume of 500 |il. The reaction was carried out at 80°C 
for 15-45 minutes. The reaction was then cooled down to room temperature, and 500 |il of o- 
dianisidine and glucose oxidase/peroxidase mixture (Sigma) was added. The mixture was 
incubated at 37°C for 30 minutes. 500 nl of 12 N sulfuric acid was added to stop the reaction. 
Absorbance at 540 nm was measured to quantitate the amount of glucose released by the 
amylase/sample. Assay of both the fused and unfused amylase extracts gave similar levels of 
hyperthermophilic amylase activity, whereas control extracts were negative. This indicated that 
the 797GL3 amylase was still active (at high temperatures) when fused to the C-terminal portion 
of the waxy protein. 

Example 3 

Isolation of promoter fragments for endosperm-specific expression in maize . 
The promoter and 5' noncoding region I (including the first intron) from the large subunit 
of Zea mays ADP-gpp (ADP-glucose pyrophosphorylase) was amplified as a 1515 base pair 
fragment (SEQ ID NO:l 1) from maize genomic DNA using primers designed from Genbank 
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accession M8I603. The ADP-gpp promoter has been shown to be endosperm-specific (Shaw 
and Hannah, 1992). 

The promoter from the Zea mays y-zein gene was amplified as a 673 bp fragment (SEQ 
ED NO: 12) from plasmid pGZ27.3 (obtained from Dr. Brian Larkins). The y-zein promoter has 
been shown to be endosperm-specific (Torrent et al. 1997), 

Example 4 

Construction of transformation vectors for the 797GL3 hvperthermophilic a-amvlase 

Expression cassettes were constructed to express the 797GL3 hyperthermophilic amylase 
in maize endosperm with various targeting signals as follows: 

pNOV6200 (SEQ ID NO: 1 3) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ED NO: 17) fused to the synthetic 797GL3 amylase as 
described above in Example 1 for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1997). The fusion was cloned behind the maize ADP-gpp promoter for 
expression specifically in the endosperm. 

pNOV6201 (SEQ ID NO: 14) comprises the y-zein N-terminal signal sequence fused to 
the synthetic 797GL3 amylase with a C-terminal addition of the sequence SEKDEL for targeting 
to and retention in the endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was 
cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 

pNOV7013 comprises the y-zein N-terminal signal sequence fused to the synthetic 
797GL3 amylase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the endoplasmic reticulum (ER). PNOV7013 is the same as pNOV6201, except that 
the the maize y- zein promoter (SEQ ID NO: 12) was used instead of the maize ADP-spp 
promoter in order to express the fusion in the endosperm. 

pNOV4029 (SEQ ID NO: 15) comprises the waxy amyloplast targeting peptide (KJosgen 
et al., 1986) fused to the synthetic 797GL3 amylase for targeting to the amyloplast. The fusion 
was cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 
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pNOV4031 (SEQ TD NO: 16) comprises the waxy amyloplast targeting peptide fused to 
the synthetic 797GL3/waxy fusion protein for targeting to starch granules. The fusion was 
cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 

Additional constructs were made with these fusions cloned behind the maize y-zein 
promoter to obtain higher levels of enzyme expression. All expression cassettes were moved 
into a binary vector for transformation into maize via Agrobacterium infection. The binary 
vector contained the phosphomannose isomerase (PMI) gene which allows for selection of 
transgenic cells with mannose. Transformed maize plants were either self-pollinated or 
outcrossed and seed was collected for analysis. 

Additional constructs were made with the targeting signals described above fused to 
either 6gp3 pullulanase or to 340gl2 oc-glucosidase in precisely the same manner as described 
for the a-amylase. These fusions were cloned behind the maize ADP-gpp promoter and/or the y 
zein promoter and transformed into maize as described above. Transformed maize plants were 
either self-pollinated or outcrossed and seed was collected for analysis. 

Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable cotransformation. 

Example 5 

Construction of plant transformation vectors for the 6GP3 thermophil lic pullulanase 
An expression cassette was constructed to express the 6GP3 thermophillic pullanase in 
the endoplasmic reticulum of maize endosperm as follows: 

pNOV7005 (SEQ ID NOs:24 and 25) comprises the maize y-zein N-terminal signal 
sequence fused to the synthetic 6GP3 pullulanase with a C-terminal addition of the sequence 
SEKDEL for targeting to and retention in the ER. The amino acid peptide SEKDEL was fused 
to the C-terminal end of the enzymes using PCR with primers designed to amplify the synthetic 
gene and simultaneously add the 6 amino acids at the C-terminal end of the protein. The fusion 
was cloned behind the maize y-zein promoter for expresson specifically in the endosperm. 

Example 6 

Construction of plant transformation vectors for the malA 
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hyperthermophilic a-glucosidase 

Expression cassettes were constructed to express the Sulfolobus solfataricas malA 
hyperthermophilic a-glucosidase in maize endosperm with various targeting signals as follows: 

pNOV4831 (SEQ ID NO:26) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ED NO: 17) fused to the synthetic malA a-glucosidase 
with a C-termina! addition of the sequence SEKDEL for targeting to and retention in the 
endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the 
maize y-zein promoter for expresson specifically in the endosperm. 

pNOV4839 (SEQ ID NO:27) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic malA a-glucosidase for 
targeting to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The 
fusion was cloned behind the maize y-zein promoter for expression specifically in the 
endosperm. 

pNOV4837 comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic malA a-glucosidase 
with a C-terminal addition of the sequence SEKDEL for targeting to and retention in the ER. 
The fusion was cloned behind the maize ADPgpp promoter for expression specifically in the 
endosperm. The amino acid sequence for this clone is identical to that of pNOV483 1 (SEQ ID 
NO:26). 
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Example 7 

Construction of plant transformation vectors for the hvperthermophillic 
Thermotoga maritima and Thermotoga neapolitana glucose isomerases 

Expression cassettes were constructed to express the Thermotoga maritima and 
TJiermotoga neapolitana hyperthermophilic glucose isomerases in maize endosperm with 
various targeting signals as follows: 

pNOV4832 (SEQ DD NO:28) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ DD NO:l 7) fused to the synthetic Thermotoga maritima 
glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the ER. The fusion was cloned behind the maize y-zein promoter for expression 
specifically in the endosperm. 

pNOV4833 (SEQ DD NO:29) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga 
neapolitana glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting 
to and retention in the ER. The fusion was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4840 (SEQ ID NO:30) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga 
neapolitana glucose isomerase for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1 997). The fiision was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4838 comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga neapolitana 
glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the ER. The fusion was cloned behind the maize ADPgpp promoter for expression 
specifically in the endosperm. The amino acid sequence for this clone is identical to that of 
pNOV4833 (SEQ ID NO:29). 
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Example 8 

Construction of plant transformation vectors for the 
expression of the hvperthermophillic glucanase EglA 
pNOV4800 (SEQ ID NO:58) comprises the barley alpha amylase AMY32b signal 
sequence (MGKNGNLCCFSLLLLLLAGLASGHQ)(SEQ ED NO:31) fused with the EglA 
mature protein sequence for localization to the apoplast. The fusion was cloned behind the 
maize y-zein promoter for expression specifically in the endosperm. 

Example 9 

Construction of plant transformation vectors for the 
expression of multiple hvperthermophillic enzymes 

pNOV4841 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
6GP3 pullulanase fusion. Both 797GL3 fusion (SEQ ID NO:33) and 6GP3 fusion (SEQ ID 
NO:34) possessed the maize y-zein N-terminal signal sequence and SEKDEL sequence for 
targeting to and retention in the ER. Each fusion was cloned behind a separate maize y-zein 
promoter for expression specifically in the endosperm. 

pNOV4842 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
malA a-glucosidase fusion. Both the 797GL3 fusion polypeptide (SEQ ID NO:35) and malA a- 
glucosidase fusion polypeptide (SEQ ID NO:36) possess the maize y-zein N-terminal signal 
sequence and SEKDEL sequence for targeting to and retention in the ER. Each fusion was 
cloned behind a separate maize y-zein promoter for expression specifically in the endosperm. 

pNOV4843 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
malA a-glucosidase fusion. Both the 797GL3 fusion and malA a-glucosidase fusion possess the 
maize y-zein N-terminal signal sequence and SEKDEL sequence for targeting to and retention in 
the ER. The 797GL3 fusion was cloned behind the maize y-zein promoter and the malA fusion 
was cloned behind the maize ADPgpp promoter for expression specifically in the endosperm. 
The amino acid sequences of the 797GL3 fusion and the malA fusion are identical to those of 
pNOV4842 (SEQ ID Nos: 35 and 36, respectively). 

pNOV4844 comprises a triple gene construct of a 797GL3 a-amylase fusion, a 6GP3 

pullulanase fusion, and a malA a-glucosidase fusion. 797GL3, malA, and 6GP3 all possess the 
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maize y-zein N-terminal signal sequence and SEKDEL sequence for targeting to and retention in 
the ER. The 797GL3 and malA fusions were cloned behind 2 separate maize y-zein promoters, 
and the 6GP3 fusion was cloned behind the maize ADPgpp promoter for expression specifically 
in the endosperm. The amino acid sequences for the 797GL3 and malA fusions are identical to 
those of pNOV4842 (SEQ ID Nos: 35 and 36, respectively). The amino acid sequence for the 
6GP3 fusion is identical to that of the 6GP3 fusion in pNOV4841 (SEQ ID NO:34). 

All expression cassettes set forth in this Example as well as in the Examples that follow 
were moved into the binary vector pNOV21 17 for transformation into maize via Agrobaciehum 
infection. pNOV21 1 7 contains the phosphomannose isomerase (PMI) gene allowing for 
selection of transgenic cells with mannose. pNOV21 17 is a binary vector with both the pVSl 
and ColEl origins of replication. This vector contains the constitutive VirG gene from pAD1289 
(Hansen, G., et at., PNAS USA 91 :7603-7607 (1994), incorporated by reference herein) and a 
spectinomycin resistance gene from Tn7. Cloned into the polylinker between the right and left 
borders are the maize ubiquitin promoter, PMI coding region and nopaline synthase terminator of 
pNOVl 17 (Negrotto, D,, et al, Plant Cell Reports 19:798-803 (2000), incorporated by reference 
herein). Transformed maize plants will either be self-pollinated or outcrossed and seed collected 
for analysis. Combinations of the different enzymes can be produced either by crossing plants 
expressing the individual enzymes or by transforming a plant with one of the multi-gene 
cassettes. 

Example 10 

Construction of bacterial and Pichia expression vectors 
Expression cassettes were constructed to express the hyperthermophilic a-glucosidase 

and glucose isomerases in either Pichia or bacteria as follows: 

pNOV4829 (SEQ ID NOS: 37 and 38) comprises a synthetic Thermotoga maritima 

glucose isomerase fusion with ER retention signal in the bacterial expression vector pET29a. 

The glucose isomerase fusion gene was cloned into the Ncol and Sad sites of pET29a, which 

results in the addition of an N-terminal S-tag for protein purification. 

pNOV4830 (SEQ ID NOS: 39 and 40) comprises a synthetic Tfiermotoga neapolitana 

glucose isomerase fusion with ER retention signal in the bacterial expression vector pET29a. 
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The glucose isomerase fission gene was cloned into the Ncol and Sad sites of pET29a, which 
results in the addition of an N-terminal S-tag for protein purification. 

pNOV4835 (SEQ ID NO: 41 and 42) comprises the synthetic Thermotoga maritima 
glucose isomerase gene cloned into the BamHI and EcoRI sites of the bacterial expression vector 
pET28C. This resulted in the fusion of a His-tag (for protein purification) to the N-terminal end 

of the glucose isomerase. 

pNOV4836 (SEQ ID NO: 43 AND 44) comprises the synthetic Thermotoga neapolitana 
glucose isomerase gene cloned into the BamHI and EcoRI sites of the bacterial expression vector 
pET28C. This resulted in the fusion of a His-tag (for protein purification) to the N-terminal end 

of the glucose isomerase. 

Example 11 

Transformation of immature maize embryos was performed essentially as described in 
Negrotto et al., Plant Cell Reports 19: 798-803. For this example, all media constituents are as 
described in Negrotto et al., supra. However, various media constituents described in the 

literature may be substituted. 

A. Transformation plasmids and selectable marker 

The genes used for transformation were cloned into a vector suitable for maize 
transformation. Vectors used in this example contained the phosphomannose isomerase (PMI) 
gene for selection of transgenic lines (Negrotto et al. (2000) Plant Cell Reports 19: 798-803). 
B. Preparation of Azrobacterium tumefaciens 

Agrobacterium strain LBA4404 (pSBl) containing the plant transformation plasmid was 
grown on YEP (yeast extract (5 g/L), peptone (lOg/L), NaCl (5g/L),15g/l agar, pH 6.8) solid 
medium for 2 - 4 days at 28°C Approximately 0.8X 10 9 Agrobacterium were suspended in LS- 
inf media supplemented with 100 \\U As (Negrotto et a/.,(2000) Plant Cell Rep 19: 798-803). 
Bacteria were pre-induced in this medium for 30-60 minutes. 
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C Inoculation 

Immature embryos from A 188 or other suitable genotype were excised from 8 - 12 day old ears 
into liquid LS-inf + 100 As. Embryos were rinsed once with fresh infection medium. 
Agrobacterium solution was then added and embryos were vortexed for 30 seconds and allowed 
to settle with the bacteria for 5 minutes. The embryos were then transferred scutellum side up to 
LSAs medium and cultured in the dark for two to three days. Subsequently, between 20 and 25 
embryos per petri plate were transferred to LSDc medium supplemented with cefotaxime (250 
mg/1) and silver nitrate (1 .6 mg/1) and cultured in the dark for 28°C for 10 days. 

D. Selection of transformed cells and regeneration of transformed plants 
Immature embryos producing embryogenic callus were transferred to LSD1M0.5S medium. The 
cultures were selected on this medium for 6 weeks with a subculture step at 3 weeks. Surviving 
calli were transferred to Regl medium supplemented with mannose. Following culturing in the 
light (16 hour light/ 8 hour dark regiment), green tissues were then transferred to Reg2 medium 
without growth regulators and incubated for 1-2 weeks. Plantlets are transferred to Magenta 
GA-7 boxes (Magenta Corp, Chicago 111.) containing Reg3 medium and grown in the light. 
After 2-3 weeks, plants were tested for the presence of the PMI genes and other genes of interest 
by PCR. Positive plants from the PCR assay were transferred to the greenhouse. 

Example 12 

Analysis of Tl seed from maize plants expressing the 
q-amvlase targeted to apoplast or to the ER 
Tl seed from self-pollinated maize plants transformed with either pNOV6200 or 
pNOV6201 as described in Example 4 were obtained. Starch accumulation in these kernels 
appeared to be normal, based on visual inspection and on normal staining for starch with an 
iodine solution prior to any exposure to high temperature. Immature kernels were dissected and 
purified endosperms were placed individually in micro fiige tubes and immersed in 200 |il of 
50 mM NaP0 4 buffer. The tubes were placed in an 85°C water bath for 20 minutes, then cooled 
on ice. Twenty microliters of a 1% iodine solution was added to each tube and mixed. 
Approximately 25% of the segregating kernels stained normally for starch. The remaining 75% 
failed to stain, indicating that the starch had been degraded into low molecular weight sugars that 

84 



WO 2005/096804 



PCT/US2004/007182 



do not stain with iodine. It was found that the Tl kernels of pNOV6200 and pNOV620l were 
self-hydrolyzing the corn starch. There was no detectable reduction in starch following 
incubation at 37°C. 

Expression of the amylase was further analyzed by isolation of the hyperthermophilic 
protein fraction from the endosperm followed by PAGE/Coomassie staining. A segregating 
protein band of the appropriate molecular weight (50 kD) was observed. These samples are 
subjected to an a-amylase assay using commercially available dyed amylose (AMYLAZYME, 
from Megazyme, Ireland). High levels of hyperthermophilic amylase activity correlated with the 
presence of the 50 kD protein. 

It was further found that starch in kernels from a majority of transgenic maize, which 
express hyperthermophilic a-amylase, targeted to the amyloplast, is sufficiently active at ambient 
temperature to hydrolyze most of the starch if the enzyme is allowed to be in direct contact with 
a starch granule. Of the eighty lines having hyperthermophilic a -amylase targeted to the 
amyloplast, four lines were identified that accumulate starch in the kernels. Three of these lines 
were analyzed for thermostable a -amylase activity using a colorimetric amylazyme assay 
(Megazyme). The amylase enzyme assay indicated that these three lines had low levels of 
thermostable amylase activity. When purified starch from these three lines was treated with 
appropriate conditions of moisture and heat, the starch was hydrolyzed indicating the presence of 
adequate levels of a -amylase to facilitate the auto-hydrolysis of the starch prepared from these 

lines. 

Tl seed from multiple independent lines of both pNOV6200 and pNOV6201 
transformants was obtained. Individual kernels from each line were dissected and purified 
endosperms were homogenized individually in 300 ^1 of 50 mM NaPC>4 buffer. Aliquots of the 
endosperm suspensions were analyzed for a-amylase activity at 85°C. Approximately 80% of 
the lines segregate for hyperthermophilic activity (See Figures 1A, IB, and 2). 

Kernels from wild type plants or plants transformed with pNOV620l were heated at 
1 00°C for 1 , 2, 3, or 6 hours and then stained for starch with an iodine solution. Little or no 
starch was detected in mature kernels after 3 or 6 hours, respectively. Thus, starch in mature 
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kernels from transgenic maize which express hyperthermophilic amylase that is targeted to the 
endoplasmic reticulum was hydrolyzed when incubated at high temperature. 

In another experiment, partially purified starch from mature Tl kernels from pNOV6201 
plants that were steeped at 50°C for 16 hours was hydrolyzed after heating at 85°C for 5 minutes. 
This illustrated that the a-amylase targeted to the endoplasmic reticulum binds to starch after 
grinding of the kernel, and is able to hydrolyze the starch upon heating. Iodine staining indicated 
that the starch remains intact in mature seeds after the 16 hour steep at 50°C. 

In another experiment, segregating, mature kernels from plants transformed with 
pNOV6201 were heated at 95°C for 16 hours and then dried. In seeds expressing the 
hyperthermophilic a-amylase, the hydrolysis of starch to sugar resulted in a wrinkled appearance 
following drying. 

Example 13 

Analysis of Tl seed from maize plants expressing the 
q-amvlase targeted to the amvloplast 
Tl seed from self-pollinated maize plants transformed with either pNOV4029 or 
pNOV4031 as described in Example 4 was obtained. Starch accumulation in kernels from these 
lines was clearly not normal. All lines segregated, with some variation in severity, for a very low 
or no starch phenotype. Endosperm purified from immature kernels stained only weakly with 
iodine prior to exposure to high temperatures. After 20 minutes at 85°C, there was no staining. 
When the ears were dried, the kernels shriveled up. This particular amylase clearly had sufficient 
activity at greenhouse temperatures to hydrolyze starch if allowed to be in direct contact with the 
granule 

Example 14 

Fermentation of grain from maize plants expressing q-amvlase 
100% Transgenic grain 85°C vs. 95°C, varied liquefaction time. 
Transgenic com (pNOV6201) that contains a thermostable a-amylase performs well in 
fermentation without addition of exogenous a-amylase, requires much less time for liquefaction 
and results in more complete solubilization of starch. Laboratory scale fermentations were 
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performed by a protocol with the following steps (detailed below): 1) grinding, 2) moisture 
analysis, 3) preparation of a slurry containing ground corn, water, backset and cc-amylase, 4) 
liquefaction and 5) simultaneous saccharification and fermentation (SSF). In this example the 
temperature and time of the liquefaction step were varied as described below. In addition the 
transgenic corn was liquefied with and without exogenous ot-amylase and the performance in 
ethanol production compared to control corn treated with commercially available a-amylase. 

The transgenic corn used in this example was made in accordance with the procedures set 
out in Example 4 using a vector comprising the a-amylase gene and the PMI selectable marker, 
namely pNOV6201 . The transgenic corn was produced by pollinating a commercial hybrid 
(N3030BT) with pollen from a transgenic line expressing a high level of thermostable 
a-amylase. The corn was dried to 1 1% moisture and stored at room temperature. The <x- 
amylase content of the transgenic corn flour was 95 units/g where 1 unit of enzyme generates 1 
micromole reducing ends per min from corn flour at 85 °C in pH 6.0 MES buffer. The control 
corn that was used was a yellow dent corn known to perform well in ethanol production. 

1) Grinding: Transgenic corn (1 180 g) was ground in a Perten 3100 hammer mill 
equipped with a 2.0 mm screen thus generating transgenic com flour. Control corn was ground 
in the same mill after thoroughly cleaning to prevent contamination by the transgenic corn. 

2) Moisture analysis: Samples (20 g) of transgenic and control corn were weighed into 
aluminum weigh boats and heated at 100 C for 4 h. The samples were weighed again and the 
moisture content calculated from the weight loss. The moisture content of transgenic flour was 
9.26%; that of the control flour was 12.54%. 

3) Preparation of slurries: The composition of slurries was designed to yield a mash with 
36% solids at the beginning of SSF. Control samples were prepared in 100 ml plastic bottles and 
contained 21.50 g of control corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight), and 0.30 ml of a commercially available a-amylase diluted 1/50 with water. The a- 
amylase dose was chosen as representative of industrial usage. When assayed under the 
conditions described above for assay of the transgenic a-amylase, the control a-amylase dose 
was 2 U/g com flour. pH was adjusted to 6.0 by addition of ammonium hydroxide. Transgenic 
samples were prepared in the same fashion but contained 20 g of corn flour because of the lower 
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moisture content of transgenic flour Slurries of transgenic flour were prepared either with ot- 
amylase at the same dose as the control samples or without exogenous oc-amylase. 

4) Liquefaction: The bottles containing slurries of transgenic corn flour were immersed 
in water baths at either 85 °C or 95 °C for times of 5, 1 5, 30, 45 or 60 min. Control slurries were 
incubated for 60 min at 85 °C During the high temperature incubation the slurries were mixed 
vigorously by hand every 5 min. After the high temperature step the slurries were cooled on ice. 

5) Simultaneous saccharification and fermentation: The mash produced by liquefaction 
was mixed with glucoamylase (0.65 ml of a 1/50 dilution of a commercially available L-400 
glucoamylase), protease (0.60 ml of a 1,000-fold dilution of a commercially available protease), 
0.2 mg Lactocide & urea (0.85 ml of a 10- fold dilution of 50% Urea Liquor). A hole was cut 
into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The mash was then 
inoculated with yeast ( 1 .44 ml) and incubated in a water bath set at 90 F. After 24 hours of 
fermentation the temperature was lowered to 86 F; at 48 hours it was set to 82 F. 

Yeast for inoculation was propagated by preparing a mixture that contained yeast (0.12 g) 
with 70 grams maltodextrin, 230 ml water, 100 ml backset, glucoamylase (0.88 ml of a 10-fold 
dilution of a commercially available glucoamylase), protease (1.76 ml of a 100-fold dilution of a 
commercially available enzyme), urea (1 .07 grams), penicillin (0.67 mg) and zinc sulfate (0.13 
g). The propagation culture was initiated the day before it was needed and was incubated with 
mixing at 90°F. 

At 24, 48 & 72 hour samples were taken from each fermentation vessel, filtered through 
0.2 urn filters and analyzed by HPLC for ethanol & sugars. At 72 h samples were analyzed for 
total dissolved solids and for residual starch. 

HPLC analysis was performed on a binary gradient system equipped with refractive 
index detector, column heater & Bio-Rad Aminex HPX-87H column. The system was 
equilibrated with 0.005 M H 2 S0 4 in water at 1 ml/min. Column temperature was 50 °C. Sample 
injection volume was 5 pi; elution was in the same solvent. The RI response was calibrated by 
injection of known standards. Ethanol and glucose were both measured in each injection. 

Residual starch was measured as follows. Samples and standards were dried at 50°C in 
an oven, then ground to a powder in a sample mill. The powder (0.2 g) was weighed into a 15 
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ml graduated centrifuge tube. The powder was washed 3 times with 10 ml aqueous ethanol 
(80% v/v) by vortexing followed by centrifiigation and discarding of the supernatant. DMSO 
(2.0 ml) was added to the pellet followed by 3.0 ml of a thermostable alpha-amylase (300 units) 
in MOPS buffer. After vigorous mixing, the tubes were incubated in a water bath at 85°C for 60 
min. During the incubation, the tubes were mixed four times. The samples were cooled and 4.0 
ml sodium acetate buffer (200 mM, pH 4.5) was added followed by 0. 1 ml of glucoamylase (20 
U). Samples were incubated at 50°C for 2 hours, mixed, then centrifuged for 5 min at 3,500 
rpm. The supernatant was filtered through a 0.2 urn filter and analyzed for glucose by the HPLC 
method described above. An injection size of 50 |il was used for samples with low residual 
starch (<20% of solids). 

Results Transgenic corn performed well in fermentation without added a-amylase. The 
yield of ethanol at 72 hours was essentially the same with or without exogenous a-amylase as 
shown in Table I. These data also show that a higher yield of ethanol is achieved when the 
liquefaction temperature is higher; the present enzyme expressed in the transgenic com has 
activity at higher temperatures than other enzymes used commercially such as the Bacillus 
liquefaciens a-amylase. 
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Table I 



■ ■ 

Liquefaction 


Liquefaction 


Exogenous a- 


# replicates 


Mean 


Std. Dev. 


temp 


time 


amylase 




Ethanol % 


% v/v 


°C 


min. 






v/v 




85 


60 


Yes 


4 


17.53 


0.18 


85 


60 


No 


4 


17.78 


0.27 


95 


60 


Yes 


2 


18.22 


ND 


95 


60 


No 


2 


18.25 


ND 



When the liquefaction time was varied, it was found that the liquefaction time required for 
efficient ethanol production was much less than the hour required by the conventional process. 
Figure 3 shows that the ethanol yield at 72 hours fermentation was almost unchanged from 15 
min to 60 min liquefaction. In addition liquefaction at 95°C gave more ethanol at each time 
point than at the 85°C liquefaction. This observation demonstrates the process improvement 
achieved by use of a hyperthermophilic enzyme. 

The control corn gave a higher final ethanol yield than the transgenic corn, but the control 
was chosen because it performs very well in fermentation. In contrast the transgenic corn has a 
genetic background chosen to facilitate transformation. Introducing the a-amylase-trait into elite 
corn germplasm by well-known breeding techniques should eliminate this difference. 

Examination of the residual starch levels of the beer produced at 72 hours (Figure 4) 
shows that the transgenic ot-amylase results in significant improvement in making starch 
available for fermentation; much less starch was left over after fermentation. 

Using both ethanol levels and residual starch levels the optimal liquefaction times were 
1 5 min at 95°C and 30 min at 85°C. In the present experiments these times were the total time 
that the fermentation vessels were in the water bath and thus include a time period during which 
the temperature of the samples was increasing from room temperature to 85 Q C or 95°C. Shorter 
liquefaction times may be optimal in large scale industrial processes that rapidly heat the mash 
by use of equipment such as jet cookers. Conventional industrial liquefaction processes require 
holding tanks to allow the mash to be incubated at high temperature for one or more hours. The 
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present invention eliminates the need for such holding tanks and will increase the productivity of 
liquefaction equipment. 

One important function of a-amylase in fermentation processes is to reduce the viscosity 
of the mash. At all time points the samples containing transgenic corn flour were markedly less 
viscous than the control sample. In addition the transgenic samples did not appear to go through 
the gelatinous phase observed with all control samples; gelatinization normally occurs when 
corn slurries are cooked. Thus having the a-amylase distributed throughout the fragments of the 
endosperm gives advantageous physical properties to the mash during cooking by preventing 
formation of large gels that slow diffusion and increase the energy costs of mixing and pumping 
the mash. 

The high dose of a-amylase in the transgenic corn may also contribute to the favorable 
properties of the transgenic mash. At 85°C, the a-amylase activity of the transgenic corn was 
many times greater activity than the of the dose of exogenous a-amylase used in controls. The 
latter was chosen as representative of commercial use rates. 

Example IS 

Effective function of transgenic corn when mixed with control corn 
Transgenic corn flour was mixed with control corn flour in various levels from 5% to 
100% transgenic corn flour. These were treated as described in Example 14. The mashes 
containing transgenically expressed a-amylase were liquefied at 85 °C for 30 min or at 95 °C for 
15 min; control mashes were prepared as described in Example 14 and were liquefied at 85 °C 
for 30 or 60 min (one each) or at 95 °C for 1 5 or 60 min (one each). 

The data for ethanol at 48 and 72 hours and for residual starch are given in Table 2. The 
ethanol levels at 48 hours are graphed in Figure 5; the residual starch determinations are shown 
in Figure 6. These data show that transgenically expressed thermostable a-amylase gives very 
good performance in ethanol production even when the transgenic grain is only a small portion 
(as low as 5%) of the total grain in the mash. The data also show that residual starch is markedly 
lower than in control mash when the transgenic grain comprises at least 40% of the total grain. 
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Table 2 





85 °C Liquefaction 


95 °C Liquefaction 


Transgenic 
grain 
wt% 


D pciHiifll 
rvvoiu Wai 

Starch 


Fthanol 

48 h 


Fthanol 

-L^ \ > I via IV/ 1 

% v/v 
72 h 


Residual 
Starch 


Fthanol 
48 h 


Ethanol 
% v/v 
72 h 


100 


3 58 


16 71 


18 32 


4.19 


17.72 


21.14 


80 


4.06 


17.04 


19.2 


3.15 


17.42 


19.45 


60 


3.86 


17.16 


19.67 


4.81 


17.58 


19.57 


40 


5.14 


17.28 


19.83 


8.69 


17.56 


19.51 


20 


8.77 


17.11 


19.5 


11.05 


17.71 


19.36 


10 


10.03 


18.05 


19.76 


10.8 


17.83 


19.28 


5 


10.67 


18.08 


19.41 


12.44 


17.61 


19.38 


0* 


7.79 


17.64 


20.11 


11.23 


17.88 


19.87 



* Control samples . Values the average of 2 determinations 



Example 16 

Ethanol production as a function of liquefaction pH using 
transgenic corn at a rate of 1.5 to 12 % of total corn 
Because the transgenic com performed well at a level of 5-10% of total corn in a 
fermentation, an additional series of fermentations in which the transgenic corn comprised 1.5 to 
1 2% of the total com was performed. The pH was varied from 6.4 to 5.2 and the a-amylase 
enzyme expressed in the transgenic corn was optimized for activity at lower pH than is 
conventionally used industrially. 

The experiments were performed as described in Example 15 with the following 

exceptions: 

1 ) . Transgenic flour was mixed with control flour as a percent of total dry weight at the levels 
ranging from 1.5% to 12.0%. 

2) . Control com was N3O30BT which is more similar to the transgenic corn than the control 
used in examples 14 and 15. 

3) . No exogenous a-amylase was added to samples containing transgenic flour. 
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4) . Samples were adjusted to pH 5.2, 5.6, 6.0 or 6.4 prior to liquefaction. At least 5 samples 
spanning the range from 0% transgenic corn flour to 12% transgenic corn flour were prepared for 

each pH. 

5) . Liquefaction for all samples was performed at 85 °C for 60 min. 

The change in ethanol content as a function of fermentation time are shown in Figure 7. 
This figure shows the data obtained from samples that contained 3% transgenic com. At the 
lower pH, the fermentation proceeds more quickly than at pH 6.0 and above; similar behavior 
was observed in samples with other doses of transgenic grain. The pH profile of activity of the 
transgenic enzyme combined with the high levels of expression will allow lower pH 
liquefactions resulting in more rapid fermentations and thus higher throughput than is possible at 
the conventional pH 6.0 process. 

The ethanol yields at 72 hours are shown in Figure 8. As can be seen, on the basis of 
ethanol yield, the results showed little dependence on the amount of transgenic grain included in 
the sample. Thus the grain contains abundant amylase to facilitate fermentative production of 
ethanol. It is also demonstrates that lower pH of liquefaction results in higher ethanol yield. 

The viscosity of the samples after liquefaction was monitored and it was observed that at 
pH 6.0, 6% transgenic grain is sufficient for adequate reduction in viscosity. At pH 5.2 and 5.6, 
viscosity is equivalent to that of the control at 12% transgenic grain, but not at lower percentages 
of transgenic grain. 

Example 17 

Production of fructose from com flour using thermophilic enzymes 

Corn that expresses the hyperthermophilic a-amylase, 797GL3, was shown to facilitate 
production of fructose when mixed with an a-glucosidase (MalA) and a xylose isomerase 

(XylA). 

Seed from pNOV6201 transgenic plants expressing 797GL3 were ground to a flour in a 
Kleco cell thus creating amylase flour. Non-transgenic corn kernels were ground in the same 
manner to generate control flour. 

The a-glucosidase, MalA (from S. solfataricus), was expressed in E. coli. Harvested 
bacteria were suspended in 50 mM potassium phosphate buffer pH 7.0 containing 1 mM 4-(2- 
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aminoethyl)benzenesuIfonyl fluoride then lysed in a French pressure cell. The lysate was 
centrifiiged at 23,000 x g for 15 min at 4°C. The supernatant solution was removed, heated to 
70° C for 10 min, cooled on ice for 10 min, then centrifuged at 34,000 x g for 30 min at 4°C. The 
supernatant solution was removed and the Mai A concentrated two-fold in centricon 10 devices. 
The filtrate of the centricon 10 step was retained for use as a negative control for MalA. 

Xylose (glucose) isomerase was prepared by expressing the xylA gene of T. neapolitana 
in E. coli. Bacteria were suspended in 100 mM sodium phosphate pH 7.0 and lysed by passage 
through a French pressure cell. After precipitation of cell debris, the extract was heated at 80° C 
for 10 min then centrifuged. The supernatant solution contained the XylA enzymatic activity. 
An empty-vector control extract was prepared in parallel with the XylA extract. 

Com flour (60 mg per sample) was mixed with buffer and extracts from E coli. As 
indicated in Table 3, samples contained amylase corn flour (amylase) or control corn flour 
(control), 50 \x\ of either MalA extract (+) or filtrate (-), and 20 \x\ of either XylA extract (+) or 
empty vector control (-). All samples also contained 230 ^il of 50mM MOPS, lOmM MgS04, 
and 1 mM CoC12; pH of the buffer was 7.0 at room temperature. 

Samples were incubated at 85°C for 18 hours. At the end of the incubation time, samples 
were diluted with 0.9 ml of 85°C water and centrifuged to remove insoluble material. The 
supernatant fraction was then filtered through a Centricon3 ultrafiltration device and analyzed by 
HPLC with ELSD detection. 

The gradient HPLC system was equipped with Astec Polymer Amino Column, 5 micron 
particle size, 250 X 4.6 mm and an Alltech ELSD 2000 detector. The system was pre- 
equilibrated with a 15:85 mixture of watenacetonitrile. The flow rate was 1 ml/min. The initial 
conditions were maintained for 5 min after injection followed by a 20 min gradient to 50:50 
watenacetonitrile followed by 10 minutes of the same solvent. The system was washed with 20 
min of 80:20 wateracetonitrile and then re-equilibrated with the starting solvent. Fructose was 
eluted at 5.8 min and glucose at 8.7 min. 
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Table 3 











fructose 


glucose 


Sample 


Corn flour 


MalA 


XylA 


peak area x 10 


peak area x 1 0" 6 


1 


amylase 


+ 


+ 


25.9 


110.3 


2 


amylase 


- 


+ 


7.0 


12.4 


3 


amylase 


+ 




0.1 


147.5 


4 


amylase 






0 


25.9 


5 


control 


+ 


+ 


0.8 


0.5 


6 


control 




+ 


0.3 


0.2 


7 


control 


+ 




1.3 


1.7 


8 


control 






0.2 


0.3 



The HPLC results also indicated the presence of larger maltooligosaccharides in all 
samples containing the a-amylase. These results demonstrate that the three thermophilic 
enzymes can function together to produce fructose from corn flour at a high temperature. 

Example 18 
Amylase Flour with Isomerase 

In another example, amylase flour was mixed with purified MalA and each of 
hvobacterial xylose isomerases: XylA of T. maritima, and an enzyme designated 
BD8037obtained from Diversa. Amylase flour was prepared as described in Example 18. 

S. solfataricus MalA with a 6His purification tag was expressed in £. coli. Cell lysate was 
prepared as described in Example 18, then purified to apparent homogeneity using a nickel 
affinity resin (Probond, Invitrogen) and following the manufacturer's instructions for native 
protein purification. 

T. maritime* XylA with the addition of an S tag and an ER retention signal was expressed 
in E. coli and prepared in the same manner as the T. neapolitana XylA described in Example 18. 

Xylose isomerase BD8037 was obtained as a lyophilized powder and resuspended in 
0.4x the original volume of water. 
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Amylase com flour was mixed with enzyme solutions plus water or buffer. All reactions 
contained 60 mg amylase flour and a total of 600nl of liquid. One set of reactions was buffered 
with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 1 mM CoCI2; in a 
second set of reactions the metal-containing buffer solution was replaced by water. Isomerase 
enzyme amounts were varied as indicated in Table 4. All reactions were incubated for 2 hours at 
90°C. Reaction supernatant fractions were prepared by centrifugation. The pellets were washed 
with an additional 600|il H 2 0 and recentrifuged. The supernatant fractions from each reaction 
were combined, filtered through a Centricon 10, and analyzed by HPLC with ELSD detection as 
described in Example 17. The amounts of glucose and fructose observed are graphed in Figure 
15. 



Table 4 



Sample 


Amylase flour 


Mai A 


Isomerase 


1 


60 mg 


+ 


none 


2 


60 mg 


+ 


T, maritime 100|il 


3 


60 mg 


+ 


T. maritima, 10 |il 


4 


60 mg 


+ 


T. mahtima, 2\x\ 


5 


60 mg 




BD8037, 100|il 


7 


60mg 


+ 


BD8037, 2^1 


C 


60 mg 


none 


none 



With each of the isomerases, fructose was produced from com flour in a dose-dependent 
manner when a-amylase and a-glucosidase were present in the reaction. These results 
demonstrate that the grain-expressed amylase 797GL3 can function with MalA and a variety of 
different thermophilic isomerases, with or without added metal ions, to produce fructose from 
corn flour at a high temperature. In the presence of added divalent metal ions, the isomerases 
can achieve the predicted fructose: glucose equilibrium at 90°C of approximately 55% fructose. 
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This would be an improvement over the current process using mesophilic isomerases, which 
requires a chromatographic separation to increase the fructose concentration. 

Example 19 
Expression of a pullulanase in corn 

Transgenic plants that were homozygous for either pNOV7013 or pNOV7005 were 
crossed to generate transgenic corn seed expressing both the 797GL3 a-amylase and 6GP3 
pullulanase. 

Tl or T2 seed from self-pollinated maize plants transformed with either pNOV 7005 or 
pNOV 4093 were obtained. pNOV4093 is a fusion of the maize optimized synthetic gene for 
6GP3 (SEQ ID: 3,4) with the amyloplast targeting sequence (SEQ ID NO: 7,8) for localization 
of the fusion protein to the amyloplast. This fusion protein is under the control of the ADPgpp 
promoter (SEQ ED NO: 11 ) for expression specifically in the endosperm. The pNOV7005 
construct targets the expression of the pullulanase in the endoplasmic reticulum of the 
endosperm. Localization of this enzyme in the ER allows normal accumulation of the starch in 
the kernels. Normal staining for starch with an iodine solution was also observed, prior to any 
exposure to high temperature. 

As described in the case of a-amylase the expression of pullulanase targeted to the 
amyloplast (pNOV4093) resulted in abnormal starch accumulation in the kernels. When the 
corn-ears are dried, the kernels shriveled up. Apparently, this thermophilic pullulanase is 
sufficiently active at low temperatures and hydrolyzes starch if allowed to be in direct contact 
with the starch granules in the seed endosperm. 

Enzyme preparation or extraction of the enzvme from corn-flour : The pullulanase 
enzyme was extracted from the transgenic seeds by grinding them in Kleco grinder, followed by 
incubation of the flour in 50mM NaOAc pH 5.5 buffer for 1 hr at RT, with continuous shaking. 
The incubated mixture was then spun for 1 5min. at 14000 rpm. The supernatant was used as 
enzyme source. 

Pullulanase assay : The assay reaction was carried out in 96-well plate. The enzyme 
extracted from the com flour (100 ill) was diluted 10 fold with 900 fi\ of 50mM NaOAc pH5.5 
buffer, containing 40 mM CaCI 2 . The mixture was vortexed, 1 tablet of Limit-Dextrizyme 
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(azurine-crosslinked-pullulan, from Megazyme) was added to each reaction mixture and 
incubated at 75 °C for 30 min (or as mentioned). At the end of the incubation the reaction 
mixtures were spun at 3500 rpm for 15 min. The supernatants were diluted 5 fold and 
transferred into 96-well flat bottom plate for absorbance measurement at 590 nm. Hydrolysis of 
azurine-crosslinked-pullulan substrate by the pullulanase produces water-soluble dye fragments 
and the rate of release of these (measured as the increase in absorbance at 590 nm) is related 
directly to enzyme activity. 

Figure 9 shows the analysis of T2 seeds from different events transformed with pNOV 
7005. High expression of pullulanase activity, compared to the non-transgenic control, can be 
detected in a number of events. 

To a measured amount (-100 /ig) of dry corn flour from transgenic (expressing 
pullulanase, or amylase or both the enzymes) and / or control (non-transgenic) 1000 /xl of 50 mM 
NaOAc pH 5.5 buffer containing 40 mM CaCl 2 was added. The reaction mixtures were vortexed 
and incubated on a shaker for 1 hr. The enzymatic reaction was started by transferring the 
incubation mixtures to high temperature (75 °C, the optimum reaction temperature for 
pullulanase or as mentioned in the figures) for a period of time as indicated in the figures. The 
reactions were stopped by cooling them down on ice. The reaction mixtures were then 
centrifuged for 10 min. at 14000 rpm. An aliquot (100 pi) of the supernatant was diluted three 
fold, filtered through 0.2-micron filter for HPLC analysis. 

The samples were analyzed by HPLC using the following conditions: 

Column: Alltech Prevail Carbohydrate ES 5 micron 250 X 4.6 mm 

Detector: Alltech ELSD 2000 

Pump: Gilson 322 

Injector: Gilson 215 injector/diluter 

Solvents: HPLC grade Acetonitrile (Fisher Scientific) and Water (purified by Waters 
Millipore System) 
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Gradient used for oligosaccharides of low degree of polymerization (DP 1-15). 

Time %Water %Acetonitrile 

0 15 85 

5 15 85 

25 50 50 

35 50 50 

36 80 20 

55 80 20 

56 15 85 
76 15 85 

Gradient used for saccharides of high degree of polymerization (DP 20- 100 and above). 

Time %Water %Acetonitrile 

0 35 65 

60 85 15 

70 85 15 

85 35 65 

100 35 65 

System used for data analysis: Gilson Unipoint Software System Version 3.2 

Figures 10A and 10B show the HPLC analysis of the hydrolytic products generated by 
expressed pullulanase from starch in the transgenic com flour Incubation of the flour of 
pullulanase expressing corn in reaction buffer at 75 <>C for 30 minutes results in production of 
medium chain oligosaccharides (DP -10-30) and short amylose chains (DP ~ 100 -200) from 
cornstarch. This figure also shows the dependence of pullulanase activity on presence of 
calcium ions. 

Transgenic com expressing pullulanase can be used to produce modified-starch/dextrin 
that is debranched (al-6 linkages cleaved) and hence will have high level of amylose/straight 
chain dextrin. Also depending on the kind of starch (e.g. waxy, high amylose etc.) used the 
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chain length distribution of the amylose/dextrin generated by the pullulanase will vary, and so 
will the property of the modified-starch/dextrin. 

Hydrolysis of a 1-6 linkage was also demonstrated using pullulan as the substrate. The 
pullulanase isolated from corn flour efficiently hydrolyzed pullulan. HPLC analysis (as 
described) of the product generated at the end of incubation showed production of maltotriose, as 
expected, due to the hydrolysis of the a 1-6 linkages in the pullulan molecules by the enzyme 
from the corn. 

Example 20 

Expression of pullulanase in corn 

Expression of the 6gp3 pullulanase was further analyzed by extraction from corn flour 
followed by PAGE and Coomassie staining. Corn-flour was made by grinding seeds, for 30 sec, 
in the Kleco grinder. The enzyme was extracted from about I50mg of flour with 1ml of 50mM 
NaOAc pH 5.5 buffer. The mixture was vortexed and incubated on a shaker at RT for lhr, 
followed by another 1 5 min incubation at 70 °C. The mixture was then spun down (14000 rpm 
for 1 5 min at RT) and the supernatant was used as SDS-PAGE analysis. A protein band of the 
appropriate molecular weight (95 kDal) was observed. These samples are subjected to a 
pullulanase assay using commercially available dye-conjugated limit-dextrins (LEMIT- 
DEXTRLZYME, from Megazyme, Ireland). High levels of thermophilic pullulanase activity 
correlated with the presence of the 95 kD protein. 

The Western blot and ELISA analysis of the transgenic corn seed also demonstrated the 
expression of -95 kD protein that reacted with antibody produced against the pullulanase 
(expressed in E. coli). 

Example 21 

Increase in the rate of starch hydrolysis and improved yield 

of small chain (fermentable) oligosaccharides 

by the addition of pullulanase expressing corn 

The data shown in Figures 1 1 A and 1 IB was generated from HPLC analysis, as 

described above, of the starch hydrolysis products from two reaction mixtures. The first reaction 

indicated as 'Amylase 1 contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase 

expressing transgenic corn made according to the method described in Example 4, for example, 
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and non-transgenic corn A 188; and the second reaction mixture * Amylase + Pullulanase 1 
contains a mixture [1:1 (w/w)] of com flour samples of a -amylase expressing transgenic com 
and pullulanase expressing transgenic com made according to the method described in Example 
19. The results obtained support the benefit of use of pullulanase in combination with a-amylase 
during the starch hydrolysis processes. The benefits are from the increased rate of starch 
hydrolysis (Figure 1 1 A) and increase yield of fermentable oligosaccharides with low DP (Figure 
I IB). 

It was found that oamylase alone or a-amylase and pullulanase (or any other 
combination of starch hydrolytic enzymes) expressed in corn can be used to produce 
maltodextrin (straight or branched oligosaccharides) (Figures 1 1 A, 1 IB, 12, and 13A). 
Depending on the reaction conditions, the type of hydrolytic enzymes and their combinations, 
and the type of starch used the composition of the maltodextrins produced, and hence their 
properties, will vary. 

Figure 12 depicts the results of an experiment carried out in a similar manner as 
described for Figure 1 1 . The different temperature and time schemes followed during incubation 
of the reactions are indicated in the figure. The optimum reaction temperature for pullulanase is 
75 °C and for a -amylase it is >95 °C. Hence, the indicated schemes were followed to provide 
scope to carry out catalysis by the pullulanase and/or the a -amylase at their respective optimum 
reaction temperature. It can be clearly deduced from the result shown that combination of a- 
amylase and pullulanase performed better in hydrolyzing cornstarch at the end of 60 min 
incubation period. 

KPLC analysis, as described above (except -1 50 mg of corn flour was used in these 
reactions), of the starch hydrolysis product from two sets of reaction mixtures at the end of 30 
min incubation is shown in Figure 13A and 13B. The first set of reactions was incubated at 85 °C 
and the second one was incubated at 95 °C. For each set there are two reaction mixtures; the first 
reaction indicated as '.Amylase X Pullulanase' contains flour from transgenic corn (generated by 
cross pollination) expressing both the a-amylase and the pullulanase, and the second reaction 
indicated as ' Amylase* mixture of com flour samples of a -amylase expressing transgenic com 
and non-transgenic corn Al 88 in a ratio so as to obtain same amount of a -amylase activity as is 
observed in the cross (Amylase X Pullulanase). The total yield of low DP oligosaccharides was 
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more in case of a-amylase and pullulanase cross compared to com expressing a -amylase alone, 
when the corn flour samples were incubated at 85 °C. The incubation temperature of 95 °C 
inactivates (at least partially) the pullulanase enzyme, hence little difference can be observed 
between * Amylase X Pullulanase 7 and 'Amylase'. However, the data for both the incubation 
temperatures shows significant improvement in the amount of glucose produced (Figure 13B), at 
the end of the incubation period, when corn flour of a -amylase and pullulanase cross was used 
compared to corn expressing a -amylase alone. Hence use of corn expressing both a -amylase 
and pullulanase can be especially beneficial for the processes where complete hydrolysis of 
starch to glucose is important. 

The above examples provide ample support that pullulanase expressed in com seeds, 
when used in combination with a-amylase, improves the starch hydrolysis process. Pullulanase 
enzyme activity, being a 1-6 linkage specific, debranches starch far more efficiently than a - 
amylase (an a -1-4 linkage specific enzyme) thereby reducing the amount of branched 
oligosaccharides (e.g. limit-dextrin, panose; these are usually non-fermentable) and increasing 
the amount of straight chain short oligosaccharides (easily fermentable to ethanol etc.). 
Secondly, fragmentation of starch molecules by pullulanase catalyzed debranching increases 
substrate accessibility for the a-amylase, hence an increase in the efficiency of the a -amylase 
catalyzed reaction results. 

Example 22 

To determine whether the 797GL3 alpha amylase and malA alpha-glucosidase could 
function under similar pH and temperature conditions to generate an increased amount of 
glucose over that produced by either enzyme alone, approximately 0.35 ug of malA alpha 
glucosidase enzyme (produced in bacteria) was added to a solution containing 1% starch and 
starch purified from either non-transgenic com seed (control) or 797GL3 transgenic com seed (in 
797GL3 com seed the alpha amylase co-purifies with the starch). In addition, the purified starch 
from non-transgenic and 797GL3 transgenic com seed was added to 1% com starch in the 
absence of any malA enzyme. The mixtures were incubated at 90°C, pH 6.0 for 1 hour, spun 
down to remove any insoluble material, and the soluble fraction was analyzed by HPLC for 
glucose levels. As shown in Figure 14, the 797GL3 alpha-amylase and malA alpha-glucosidase 
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function at a similar pH and temperature to break down starch into glucose. The amount of 
glucose generated is significantly higher than that produced by either enzyme alone. 

Example 23 

The utility of the Thermoanaerobacterium glucoamylase for raw starch hydrolysis was 
determined. As set forth in Figure 15, the hydrolysis conversion of raw starch was tested with 
water, barley a-amylase (commercial preparation from Sigma), Thermoanaerobacterum 
glucoamylase, and combinations thereof were ascertained at room temperature and at 30°C. As 
shown, the combination of the barley a-amylase with the Thermoanaerobacterium glucoamylase 
was able to hydrolyze raw starch into glucose. Moreover, the amount of glucose produced by the 
barley amylase and thermoanaerobacter GA is significantly higher than that produced by either 
enzyme alone. 

Example 24 

Maize-optimized genes and sequences for raw-starch hydrolysis and 

vectors for plant transformation 
The enzymes were selected based on their ability to hydrolyze raw-starch at temperatures 
ranging from approximately 20°-50°C. The corresponding genes or gene fragments were then 
designed by using maize preferred codons for the construction of synthetic genes as set forth in 
Example 1. 

Aspergillus shirousami a-amylase/glucoamylase fusion polypeptide (without signal 
sequence) was selected and has the amino acid sequence as set forth in SEQ ID NO: 45 as 
identified in Biosci. Biotech. Biochem., 56:884-889 (1992); Agric. Biol. Chem. 545:1905-14 
(1990); Biosci. Biotechnol. Biochem. 56:174-79 (1992). The maize-optimized nucleic acid was 
designed and is represented in SEQ ID NO:46. 

Similarly, Thermoanaerobacterium thermosaccharolyticum glucoamylase was selected, 
having the amino acid of SEQ ID NO:47 as published in Biosci. Biotech. Biochem., 62:302-308 
(1998), was selected. The maize-optimized nucleic acid was designed (SEQ ID NO: 48). 
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Rhizopus oryzae glucoamylase was selected having the amino acid sequence (without 
signal sequence)(SEQ ID NO: 50), as described in the literature (Agric. Biol. Chem. (1986) 50, 
pg 957-964). The maize-optimized nucleic acid was designed and is represented in SEQ ID 

NO:51. 

Moreover, the maize a-amylase was selected and the amino acid sequence (SEQ ID NO: 
51) and nucleic acid sequence (SEQ ID NO:52) were obtained from the literature. See, e.g., 
Plant Physiol. 105:759-760(1994), 

Expression cassettes are constructed to express the Aspergillus shirousami ct- 
amylase/glucoamylase fusion polypeptide from the maize-optimized nucleic acid was designed 
as represented in SEQ ID NO:46, the Thermoanaerobacterium thermosaccharolyticum 
glucoamylase from the maize-optimized nucleic acid was designed as represented in SEQ ID 
NO: 48, the Rhizopus oryzae glucoamylase was selected having the amino acid sequence 
(without signal sequence)(SEQ ID NO: 49) from the maize-optimized nucleic acid was designed 
and is represented in SEQ ID NO:50, and the maize a-amylase. 

A plasmid comprising the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) is ftised to the synthetic gene encoding the 
enzyme. Optionally, the sequence SEKDEL is ftised to the C-terminal of the synthetic gene for 
targeting to and retention in the ER. The ftjsion is cloned behined the maize y-zein promoter for 
expression specifically in the endosperm in a plant transformation plasmid. The fusion is 
delivered to the corn tissue via Agrobacterium transfection. 

Example 25 

Expression cassettes comprising the selected enzymes are constructed to express the 
enzymes. A plasmid comprising the sequence for a raw starch binding site is ftised to the 
synthetic gene encoding the enzyme. The raw starch binding site allows the enzyme fusion to 
bind to non-gelatinized starch. The raw-starch binding site amino acid sequence (SEQ ED 
NO:53) was determined based on literature, and the nucleic acid sequence was maize-optimized 
to give SEQ ID NO:54. The maize-optimized nucleic acid sequence is fused to the synthetic 
gene encoding the enzyme in a plasmid for expression in a plant. 
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Example 26 

Construction of maize-optimized penes and vectors for plant transformation 
The genes or gene fragments were designed by using maize preferred codons for the 

construction of synthetic genes as set forth in Example 1. 

Pyrococcus furiosus EGLA, hyperthermophiiic endoglucanase amino acid sequence 

(without signal sequence) was selected and has the amino acid sequence as set forth in SEQ ID 

NO: 55, as identified in Journal of Bacteriology (1 999) 1 8 1 , pg 284-290.) The maize-optimized 

nucleic acid was designed and is represented in SEQ ID NO:56, 

Thermus flaws xylose isomerase was selected and has the amino acid sequence as set 

forth in SEQ ID NO:57, as described in Applied Biochemistry and Biotechnology 62:15-27 

(1997). 

Expression cassettes are constructed to express the Pyrococcus furiosus EGLA 
(endoglucanase) from the maize-optimized nucleic acid (SEQ ID NO:56) and the Thermus flavus 
xylose isomerase from a maize-optimized nucleic acid encoding amino acid sequence SEQ ID 
NO: 57 A plasmid comprising the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) is ftised to the synthetic maize-optimized 
gene encoding the enzyme. Optionally, the sequence SEKDEL is fused to the C-terminal of the 
synthetic gene for targeting to and retention in the ER. The fusion is cloned behined the maize y- 
zein promoter for expression specifically in the endosperm in a plant transformation plasmid. 
The fusion is delivered to the corn tissue via Agrobacterium transfection. 

Example 27 

Production of glucose from corn flour using thermophilic enzymes expressed in corn 

Expression of the hyperthermophiiic ct-amylase, 797GL3 and a-glucosidase (MalA) were 
shown to result in production of glucose when mixed with an aqueous solution and incubated at 

90 °C 

A transgenic corn line (line 168A10B, pNOV4831) expressing MalA enzyme was 
identified by measuring a-glucosidase activity as indicated by hydrolysis of p-nitrophenyl-a- 
glucoside. 
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Com kernels from transgenic plants expressing 797GL3 were ground to a flour in a Kleco 
cell thus creating amylase flour. Corn kernels from transgenic plants expressing MalA were 
ground to a flour in a Kleco cell thus creating MalA flour Non-transgenic corn kernels were 
ground in the same manner to generate control flour. 

Buffer was 50 mM MES buffer pH 6.0. 
Com flour hydrolysis reactions : Samples were prepared as indicated in Table 5 below. Corn 
flour (about 60 mg per sample) was mixed with 40 ml of 50 mM MES buffer, pH 6.0. Samples 
were incubated in a water bath set at 90°C for 2.5 and 14 hours. At the indicated incubation 
times, samples were removed and analyzed for glucose content. 

The samples were assayed for glucose by a glucose oxidase / horse radish peroxidase 
based assay. GOPOD reagent contained: 0.2 mg/ml o-dianisidine, 100 mM Tris pH 7.5 , 100 
U/ml glucose oxidase & 10 U/ml horse radish peroxidase. 20 nl of sample or diluted sample 
were arrayed in a 96 well plate along with glucose standards (which varied from 0 to 0.22 
mg/ml). 100 pi of GOPOD reagent was added to each well with mixing and the plate incubated 
at 37 °C for 30 min. 100 \\\ of sulfuric acid (9M) was added and absorbance at 540 run was read. 
The glucose concentration of the samples was determined by reference to the standard curve. 
The quantity of glucose observed in each sample is indicated in Table 5. 
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Table 5 



Sample 


WT flour 


amylase 


MalA 


Buffer 


Glucose 


Glucose 






flour 


flour 




2.5 h 


14 h 




mg 


mg 


Mg 


ml 


mg 


mg 


1 


66 


0 


0 


40 


0 


0 


2 


31 


30 


0 


40 


0.26 


0.50 


3 


30 


0 


31.5 


40 


0 


0.09 


4 


0 


32.2 


30.0 


40 


2.29 


12.30 


5 


0 


6.1 


56.2 


40 


1.16 


8.52 



These data demonstrate that when expression of hyperthermophilic a-amylase and a- 
glucosidase in corn result in a corn product that will generate glucose when hydrated and heated 
under appropriate conditions. 

Example 28 

Production of Maltodextrins 
Grain expressing thermophilic a-amylase was used to prepare maltodextrins. The 
exemplified process does not require prior isolation of the starch nor does it require addition of 
exogenous enzymes. 

Corn kernels from transgenic plants expressing 797GL3 were ground to a flour in a Kleco 
cell to create "amylase flour". A mixture of 10% transgenic/90% non-transgenic kernels was 
ground in the same manner to create "10% amylase flour." 

Amylase flour and 10% amylase flour (approximately 60 mg/sample) were mixed with 
water at a rate of 5 |il of water per mg of flour. The resulting slurries were incubated at 90°C for 
up to 20 hours as indicated in Table 6. Reactions were stopped by addition of 0.9 ml of 50 mM 
EDTA at 85°C and mixed by pipetting. Samples of 0.2 ml of slurry were removed, centrifuged 
to remove insoluble material and diluted 3x in water. 

The samples were analyzed by HPLC with ELSD detection for sugars and maltodextrins. The 
gradient HPLC system was equipped with Astec Polymer Amino Column, 5 micron particle size, 
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250 X 4,6 mm and an Alltech ELSD 2000 detector. The system was pre-equilibrated with a 
15:85 mixture of watenacetonitrile. The flow rate was 1 ml/min. The initial conditions were 
maintained for 5 min after injection followed by a 20 min gradient to 50:50 watenacetonitrile 
followed by 10 minutes of the same solvent. The system was washed with 20 min of 80:20 
watenacetonitrile and then re-equilibrated with the starting solvent. 

The resulting peak areas were normalized for volume and weight of flour. The response 
factor of ELSD per jig of carbohydrate decreases with increasing DP, thus the higher DP 
maltodextrins represent a higher percentage of the total than indicated by peak area. 

The relative peak areas of the products of reactions with 100% amylase flour are shown 
in Figure 17. The relative peak areas of the products of reactions with 10% amylase flour are 
shown in Figure 18. 

These data demonstrate that a variety of maltodextrin mixtures can be produced by 
varying the time of heating. The level of ct-amylase activity can be varied by mixing transgenic 
a-amylase-expressing corn with wild-type corn to alter the maltodextrin profile. 

The products of the hydrolysis reactions described in this example can be concentrated 
and purified for food and other applications by use of a variety of well defined methods 
including: centrifugation, filtration, ion-exchange, gel permeation, ultrafiltration, nano filtration, 
reverse osmosis, decolorizing with carbon particles, spray drying and other standard techniques 
known to the art. 

Example 29 

Effect of time and temperature on maltodextrin production 
The composition of the maltodextrin products of autohydrolysis of grain containing 
thermophilic a-amylase may be altered by varying the time and temperature of the reaction. 

In another experiment, amylase flour was produced as described in Example 28 above 
and mixed with water at a ratio of 300jal water per 60 mg flour. Samples were incubated at 70°, 
80°, 90°, or 100° C for up to 90 minutes. Reactions were stopped by addition of 900ml of 50mM 
EDTA at 90°C, centrifuged to remove insoluble material and filtered through 0.45nm nylon 
filters. Filtrates were analyzed by HPLC as described in Example 28. 
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The result of this analysis is presented in Figure 19. The DP number nomenclature refers 
to the degree of polymerization. DP2 is maltose; DP3 is maltotriose, etc. Larger DP 
maltodextrins eluted in a single peak near the end of the elution and are labeled ">DP12". This 
aggregate includes dextrins that passed through 0.45 |im filters and through the guard column 
and does not include any very large starch fragments trapped by the filter or guard column. 

This experiment demonstrates that the maltodextrin composition of the product can be 
altered by varying both temperature and incubation time to obtain the desired 
maltooligosaccharide or maltodextrin product. 

Example 30 

Maltodextrin production 

The composition of maltodextrin products from transgenic maize containing thermophilic 
a-amylase can also be altered by the addition of other enzymes such as a -glucosidase and xylose 
isomerase as well as by including salts in the aqueous flour mixture prior to treating with heat. 

In another, amylase flour, prepared as described above, was mixed with purified MalA 
and/or a bacterial xylose isomerase, designated BD8037. S. sulfotaricus MalA with a 6His 
purification tag was expressed in E, coli. Cell lysate was prepared as described in Example 28, 
then purified to apparent homogeneity using a nickel affinity resin (Probond, Invitrogen) and 
following the manufacturer's instructions for native protein purification. Xylose isomerase 
BD8037 was obtained as a lyophilized powder from Di versa and resuspended in 0.4x the original 
volume of water. 

Amylase corn flour was mixed with enzyme solutions plus water or buffer. All reactions 
contained 60 mg amylase flour and a total of 600|il of liquid. One set of reactions was buffered 
with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 1 mM CoCl 2 ; in a 
second set of reactions the metal-containing buffer solution was replaced by water. All reactions 
were incubated for 2 hours at 90°C. Reaction supernatant fractions were prepared by 
centrifugation. The pellets were washed with an additional 600jal H 2 0 and re-centrifuged. The 
supernatant fractions from each reaction were combined, filtered through a Centricon 10, and 
analyzed by HPLC with ELSD detection as described above. 
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The results are graphed in Figure 20. They demonstrate that the grain-expressed amylase 
797GL3 can function with other thermophilic enzymes, with or without added metal ions, to 
produce a variety of maltodextrin mixtures from corn flour at a high temperature. In particular, 
the inclusion of a glucoamylase or a-glucosidase may result in a product with more glucose and 
other low DP products. Inclusion of an enzyme with glucose isomerase activity results in a 
product that has fructose and thus would be sweeter than that produced by amylase alone or 
amylase with a-glucosidase. In addition the data indicate that the proportion of DP5, DP6 and 
DP7 maltooligosaccharides can be increased by including divalent cationic salts, such as CoCl 2 
and MgS0 4 . 

Other means of altering the maltodextrin composition produced by a reaction such as that 
described here include: varying the reaction pH, varying the starch type in the transgenic or non- 
transgenic grain, varying the solids ratio, or by addition of organic solvents. 

Example 31 

Preparing dextrins, or sugars from grain without mechanical 
disruption of the grain prior to recovery of starch-derived products 
Sugars and maltodextrins were prepared by contacting the transgenic grain expressing the 
ot-amylase, 797GL3, with water and heating to 90°C overnight (>14 hours). Then the liquid was 
separated from the grain by filtration. The liquid product was analyzed by HPLC by the method 
descnbed in Example 15. Table 6 presents the profile of products detected. 
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Table 6 



Molecular species 


Concentration of Products 
Hg/25 )il injection 


Fructose 


0.4 


Glucose 


18.0 


Maltose 


56.0 


DP3* 


26.0 


DP4* 


15.9 


DPS* 


11.3 


DP6* 


5.3 


DP7* 


1.5 



* Quantification of DP3 includes maltotriose and may include isomers of maltotriose that 
have an a(!->6) bond in place of an a(l->4) bond. Similarly DP4 to DP7 quantification 
includes the linear maltooligosaccarides of a given chain length as well as isomers that have 
one or more a(l-*6) bonds in place of one or more a(l-»4) bonds 

These data demonstrate that sugars and maltodextrins can be prepared by contacting 
intact ot-amylase-expressing grain with water and heating. The products can then be separated 
from the intact grain by filtration or centrifugation or by gravitational settling. 

Example 32 

Fermentation of raw starch in corn expressing Rhizopus or\>zae glucoamviase. 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 29. The kernels are ground to a flour. The corn kernels express a protein that contains 
an active fragment of the glucoamylase of Rhizopus oryzae (Sequence ID NO: 49) targeted to the 
endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 15. Then a mash is 
prepared containing s 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 

ill 



WO 2(105/096804 



PCT/US2004/007182 



components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86°C; at 48 hours it is set to 82 °C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 33 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The kernels are ground to a flour. The corn kernels express a protein that contains 
an active fragment of the glucoamylase of Rhizopus oryzae (Sequence DD NO: 49) targeted to the 
endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 15, Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 34 

Example of fermentation of raw starch in whole kernels of com expressing 
Rhizopus oryzae glucoamylase with addition of exogenous a-amvlase 
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Transgenic com kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Rhizopus oryzae (Sequence ID NO: 49) targeted to the endoplasmic reticulum. 

The com kernels are contacted with 20 g of com flour, 23 ml of de-ionized water, 6.0 ml 
of backset (8% solids by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. 
The following components are added: barley a-amylase purchased from Sigma (2 mg), protease 
(0.60 ml of a 1,000-fold dilution of a commercially available protease), 0.2 mg Lactocide & urea 
(0.85 ml of a 1 0-fold dilution of 50% Urea Liquor). A hole is cut into the cap of the 100 ml 
bottle containing the mixture in order to allow C0 2 to vent. The mixture is then inoculated with 
yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 hours of fermentation the 
temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 35 

Fermentation of raw starch in com expressing Rhizopus oryzae 

elucoamvlase and Zea mays amylase 
Transgenic com kernels are harvested from transgenic plants made as described in 
Example 28. The com kernels express a protein that contains an active fragment of the 
glucoamylase of Rhizopus oryzae (Sequence ID NO:49) targeted to the endoplasmic reticulum. 
The kernels also express the maize amylase with raw starch binding domain as described in 
Example 28. 

The com kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of com flour. 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
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mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90 F. After 24 
hours of fermentation the temperature is lowered to 86 F; at 48 hours it is set to 82 F. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 36 

Example of fermentation of raw starch in corn expressing 
Thermoanaerobacter thermosaccharolvticum glucoamvlase . 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerobucter thermosaccharolyticum (Sequence ID NO: 47) targeted to 
the endoplasmic reticulum. 

The com kernels are ground to a flour as described in Example 1 5. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 37 

Example of fermentation of raw starch in com expressing 

Aspergillus niger glucoamvlase 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Aspergillus niger (Fiil,N.P. "Glucoamylases Gl and G2 from Aspergillus niger 
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are synthesized from two different but closely related mRNAs" EMBO J. 3 (5), 1097-1 102 
(1984), Accession number P04064). The maize-optimized nucleic acid encoding the 
glucoamylase has SEQ ID NO:59 and is targeted to the endoplasmic reticulum. 

The com kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 38 

Example of fermentation of raw starch in corn expressing 
Aspergillus niger glucoamylase and Zea mays amylase 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The com kernels express a protein that contains an active fragment of the 
glucoamylase of Aspergillus niger (Fiil.N.P. "Glucoamylases Gl and G2 from Aspergillus niger 
are synthesized from two different but closely related mRNAs" EMBO J. 3 (5), 1097-1 102 
(1984) : Accession number P04064)(SEQ ID NO:59, maize-optimized nucleic acid) and is 
targeted to the endoplasmic reticulum. The kernels also express the maize amylase with raw 
starch binding domain as described in example 28. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
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A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 39 

Example of fermentation of raw starch in corn expressing 
Thermoanaerobacter thermosaccharolvticum glucoamvlase and barley amylase 

Transgenic com kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerobacter thermosaccharolyticum (Sequence ID NO: 47) targeted to 
the endoplasmic reticulum. The kernels also express the low pi barley amylase amyl gene 
(Rogers.J.C. and Milliman,C. "Isolation and sequence analysis of a barley alpha-amylase cDNA 
clone" J. Biol. Chem. 258 (13), 8169-8174 (1983) modified to target expression of the protein to 
the endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10- fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast ( 1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 
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Example 40 

Example of fermentation of raw starch in whole kemals of corn expressing Thermoanaerobacier 

thermosaccharolyticum glucoamylase and barley amylase, 
Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerobacter thermosaccharolyticum (Sequence ID NO: 47) targeted to 
the endoplasmic reticulum. The kernels also express the low pi barley amylase amyl gene 
(RogersJ.C. and Milliman,C. "Isolation and sequence analysis of a barley alpha-amylase cDNA 
clone" J. Biol. Chem. 258 (13), 8169-8174 (1983) modified to target expression of the protein to 
the endoplasmic reticulum. 

The corn kernels are contacted with 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml 
of backset (8% solids by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. 
The following components are added to the mixture: protease (0.60 ml of a 1,000-fold dilution 
of a commercially available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 
50% Urea Liquor), A hole is cut into the cap of the 1 00 ml bottle containing the mash to allow 
CO2 to vent. The mixture is then inoculated with yeast (1.44 ml) and incubated in a water bath 
set at 90° C. After 24 hours of fermentation the temperature is lowered to 86° C; at 48 hours it is 
set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 
Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 41 

Example of fermentation of raw starch in com expressing an 
alpha-amylase and glucoamylase fusion. 
Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a maize-optimized polynucleotide such as provided in 
SEQ ID NO: 46, encoding an alpha-amylase and glucoamylase fusion, such as provided in SEQ 
ID NO: 45, which are targeted to the endoplasmic reticulum. . 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
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by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 42 

Construction of transformation vectors 

Expression cassettes were constructed to express the hyperthermophilic beta-glucanase 

EglA in maize as follows: 

pNOV4800 comprises the barley Amy32b signal peptide 

(MGKNGNLCCFSLLLLLLAGLASGHQ) fused to the synthetic gene for the EglA beta- 
glucanase for targeting to the endoplasmic reticulum and secretion into the apoplast. The fusion 
was cloned behind the maize y-zein promoter for expression specifically in the endosperm. 

pNOV4803 comprises the barley Amy32b signal peptide fused to the synthetic gene for the 
EglA beta-glucanase for targeting to the endoplasmic reticulum and secretion into the apoplast. 
The fusion was cloned behind the maize ubiquitin promoter for expression throughout the plant. 

Expression cassettes were constructed to express the thermophilic beta-glucanase/mannanase 
6GP1 (SEQ ID NO: 85) in maize as follows: 
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pNOV4819 comprises the tobacco PRla signal peptide 

(MGFVLFSQLPSFLLVSTLLLFLVISHSCRA) fused to the synthetic gene for the 6GP1 beta- 
glucanase/mannanase for targeting to the endoplasmic reticulum and secretion into the apoplast. 
The fusion was cloned behind the maize y-zein promoter for expression specifically in the 
endosperm. 

pNOV4820 comprises the synthetic gene for 6GP1 cloned behind the maize y-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

pNOV4823 comprises the tobacco PRla signal peptide fused to the synthetic gene for the 6GP1 
beta-glucanase/mannanase with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter 
for expression specifically in the endosperm. 

pNOV4825 comprises the tobacco PRla signal peptide fused to the synthetic gene for the 6GP1 
beta-glucanase/mannanase with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum. The fusion was cloned behind the maize ubiquitin 
promoter for expression throughout the plant. 

Expression cassettes were constructed to express the barley Amyl alpha-amylase (SEQ ID NO: 
87) in maize as follows: 

pNOV4867 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase with a C-terminal addition of the sequence SEKDEL for targeting to and retention 
in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4879 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase with a C-terminal addition of the sequence SEKDEL for targeting to and retention 
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in (he endoplasmic reticulum. The fusion was cloned behind the maize globulin promoter for 
expression specifically in the embryo. 

pNOV4897 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase for targeting to the endoplasmic reticulum and secretion into the apoplast. The 
fusion was cloned behind the maize globulin promoter for expression specifically in the embryo. 

pNOV489S comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase for targeting to the endoplasmic reticulum and secretion into the apoplast. The 
fusion was cloned behind the maize y-zein promoter for expression specifically in the endosperm 

pNOV4901 comprises the gene for the barley Amyl alpha-amylase cloned behind the maize 
globulin promoter for cytoplasmic localization and expression specifically in the embryo. 

Expression cassettes were constructed to express the Rhizopus glucoamylase (SEQ ID NO: 50) 
in maize as follows: 

pNOV4872 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase with a C-terminal addition of the sequence SEKDEL for targeting to 
and retention in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein 
promoter for expression specifically in the endosperm. 

pNOV4880 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase with a C-terminal addition of the sequence SEKDEL for targeting to 
and retention in the endoplasmic reticulum. The fusion was cloned behind the maize globulin 
promoter for expression specifically in the embryo. 

pNOV4889 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase for targeting to the endoplasmic reticulum and secretion into the 
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apoplast, The fusion was cloned behind the maize globulin promoter for expression specifically 

in the embryo. 

pNOV4890 comprises the maize y-zein N-terminal signal sequence ftised to the synthetic gene 
for Rhizopus glucoamylase for targeting to the endoplasmic reticulum and secretion into the 
apoplast. The fusion was cloned behind the maize y-zein promoter for expression specifically in 
the endosperm. 

pNOV4891 comprises the synthetic gene for Rhizopus glucoamylase cloned behind the maize y- 
zein promoter for cytoplasmic localization and expression specifically in the endosperm. 

Example 43 

Expression of the mesophilic Rhizopus glucoamylase in corn 

A variety of constructs were generated for the expression of the Rhizopus glucoamylase 
in corn. The maize y-zein and globulin promoters were used to express the glucoamylase 
specifically in the endosperm or embryo, respectively. In addition, the maize y-zein signal 
sequence and a synthetic ER retention signal were used to regulate the subcellular 
localization of the glucoamylase protein. All 5 constructs (pNOV4872, pNOV4880, 
pNOV4889, pNOV4890, and pNOV4891) yielded transgenic plants with glucoamylase 
activity detected in the seed. Tables 7 and 8 show the results for individual transgenic seed 
(construct pNOV4872) and pooled seed (construct pNOV4889), respectively. No 
detrimental phenotype was observed for any transgenic plants expressing this Rhizopus 
glucoamylase. • 

Glucoamylase assay: Seed were ground to a flour and the flour was suspended in water. 
The samples were incubated at 30 degrees for 50 minutes to allow the glucoamylase to react 
with the starch. The insoluble material was pelleted and the glucose concentration was 
determined for the supernatants. The amount of glucose liberated in each sample was taken 
as an indication of the level of glucoamylase present. Glucose concentration was determined 
by incubating the samples with GOHOD reagent (300mM Tris/Cl pH7.5, glucose oxidase 
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(20U/ml), horseradish peroxidase (20U/m!), o-dianisidine 0.1 mg/ml) for 30 minutes at 37 
degrees C, adding 0.5 volumes of 12N H2S04, and measuring the OD540. 

Table 7 shows activity of the Rhizopus glucoamylase in individual transgenic corn seed 
(construct pNOV4872). 



Table 7 






U/g 


Seed 


flour 


Wild Type #1 


0.07 


Wild Type #2 


0.55 


Wild Type #3 


0.25 


Wild Type #4 


0.33 


Wild Type #5 


0.30 


Wild Type #6 


0.42 


Wild Type #7 


-0.01 


Wild Type #8 


0.31 


MD9L022156#1 


5.17 


MD9L022156 #2 


1.66 


MD9L022156 #3 


7.66 


MD9L022156 #4 


1.77 


MD9L022156 #5 


7.08 


MD9L022156 #6 


4.46 


MD9L022156 #7 


2.20 


MD9L022156 #8 


3.50 


MD9L023377 #1 


9.23 


MD9L023377 #2 


4.30 


MD9L023377 #3 


6.72 


MD9L023377 #4 


3.35 


MD9L023377 #5 


0.56 


MD9L023377 #6 


4.79 


MD9L023377 #7 


4.60 


MD9L023377 #8 


6.01 


MD9L023043 #1 


4.93 


MD9L023043 #2 


8.74 


MD9L023043 #3 


2.70 


MD9L023043 #4 


0.72 


MD9L023043 #5 


3.33 


MD9L023043 #6 


3.53 


MD9L023043 #7 


3.94 


MD9L023043 #8 


11.51 
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MD9L023334 #1 


4.28 


MD9L023334 #2 


2.86 


MD9L023334 #3 


0.56 


MD9L023334 #4 


6,96 


MD9L023334 #5 


3.29 


MD9L023334 #6 


3.18 


MD9L023334 #7 


4.57 


MD9L023334 #8 


ai S 

7.44 


MD9L022039 #1 


6.25 


MD9L022039 #2 


2.85 


MD9L022039 #3 


4.32 


MD9L022039 #4 


2.51 


MD9L022039#5 


5.06 


MD9L022039 #6 


5.03 


MD9L022039 #7 


2.79 


MD9L022039 #8 


2.98 



Tabic 8 shows activity of the Rhizopus glucoamylase in pooled transgenic corn seed 
(construct pNOV4889). 



Table 8 



Seed 


U/g 
flour 


Wild Type 


0.38 


MD9L023347 


2.14 


MD9L023352 


2.34 


MD9L023369 


1.66 


MD9L023469 


1.42 


MD9L023477 


1.33 


MD9L023482 


1.95 


MD9L023484 


1.32 


MD9L024170 


1.35 


MD9L024177 


1.48 


MD9L024184 


1.60 


MD9L024186 


1.34 


MD9L024196 


1.38 


MD9L024228 


1.69 


MD9L024263 


1.70 


MD9L024315 


1.32 
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MD9L024325 1.73 
MD9L024333 1.41 
MD9L0243391 1.84 

All expression cassettes were inserted into the binary vector pNOV21 17 for 
transformation into maize via Agrobacterium infection. The binary vector contained the 
phosphomannose isomerase (PMI) gene which allows for selection of transgenic cells with 
mannose. Transformed maize plants were either self-pollinated or outcrossed and seed was 
collected for analysis. 



Example 44 

Expression of the hyperthermophilic beta-glucanase EglA in corn 

For expression of the hyperthermophilic beta-glucanase EglA in corn we utilized the 
ubiquitin promoter for expression throughout the plant and the y-zein promoter for expression 
specifically in the endosperm of corn seed. The barley Amy32b signal peptide was fused to 
EglA for localization in the apoplast. 

Expression of the hyperthermophilic beta-glucanase EglA in transgenic corn seed and leaves was 
analysed using an enzymatic assay and western blotting. 

Transgenic seed segregating for construct pNOV4800 or pNOV4803 were analysed using 
both western blotting and an enzymatic assay for beta-glucanase. Endosperm was isolated from 
individual seed after soaking in water for 48 hours. Protein was extracted by grinding the 
endosperm in 50mM NaP04 buffer (pH 6.0). Heat -stable proteins were isolated by heating the 
extracts at 100 degrees C for 15 minutes, followed by pelleting of the insoluble material. The 
supernatant containing heat-stable proteins was analysed for beta glucanase activity using the 
azo-barley glucan method (megazyme). Samples were pre-incubated at 100 degrees C for 10 
minutes and assayed for 10 minutes at 100 degrees C using the azo-barley glucan substrate. 
Following incubation, 3 volumes of precipitation solution were added to each sample, the 
samples were centrifuged for 1 minute, and the OD590 of each supernatant was determined. In 
addition, 5ug of protein were separated by SDS-PAGE and blotted to nitrocellulose for western 
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blot analysis using antibodies against the EglA protein. Western blot analysis detected a 
specific, heat-stable protein(s) in the EglA positive endosperm extracts, and not in negative 
extracts. The western blot signal correlates with the level of EglA activity detected 
enzymatically. 

EglA activity was analysed in leaves and seed of plants containing the transgenic 
constructs pNOV4803 and pNOV4800, respectively. The assays (conducted as described above) 
showed that the heat-stable beta-glucanase EglA was expressed at various levels in the leaves 
(Table 9) and seed (Table 10) of transgenic plants while no activity was detected in non- 
transgcnic control plants. Expression of EglA in corn utilizing constructs pNOV4800 and 
pNOV4803 did not result in any detectable negative phenotype. 

Table 9 shows the activity of the hyperthermophilic beta-glucanase EglA in leaves of 
transgenic corn plants. Enzymatic assays were conducted on extracts from leaves of pNOV4803 
transgenic plants to detect hyperthermophilic beta-glucanase acitivity. Assays were conducted at 
100 degrees C using the azo-barley glucan method (megazyme). The results indicate that the 
transgenic leaves have varying levels of hyperthermophilic beta-glucanase activity. 



Tabic 9 




Plant 


Abs590 


Wild Type 


0 


266A-17D 


0.008 


266A-18E 


0.184 


266A-13C 


0.067 


266A-15E 


0.003 


266 A- 11 E 


0 


265C-1B 


0.024 


265C-1C 


0.065 


265C-2D 


0.145 


265C-5C 


0.755 


265C-5D 


0.133 


265C-3A 


0.076 


266A-4B 


0.045 


266A-12B 


0.066 


266 A- 11 C 


0.096 
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266A-14B 
266A-4C 
266A-4A 
266A-12A 
266A-15B 
266A-11A 
266A-20C 
266A-19D 
266A-12C 
266A-4E 
266A-18B 
265C-3D 
266A-20E 
266A-13D 
265C-3B 
266A-15A 
266A-13A 
265C-3E 
266A-20A 
266A-20B 
266A-19C 
266A-20D 
266A-4D 
266A-18A 
265C-5E 
266A-17E 
266A-11B 
265C-4E 
265C-4D 



0.074 
0.107 
0.084 
0.054 
0.052 
0.109 
0.044 
0.02 
0.098 
0.248 
0.367 
0.066 
0.163 
0.084 
0.065 
0.131 
0.169 
0.116 
0.365 
0.521 
0.641 
0.561 
0.363 
0.676 
0.339 
0.221 
0.251 
0.138 
0.242 
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Table 10 shows the activity of the hyperthermophilic beta-glucanase EglA in seed of 
transgenic corn plants. Enzymatic assays were conducted on extracts from individual, 
segregating seed of pNOV4800 transgenic plants to detect hyperthermophilic beta-glucanase 
acitivity. Assays were conducted at 100 degrees C using the azo-barley glucan method 
(megazyme). The results indicate that the transgenic seed have varying levels of 
hyperthermophilic beta-glucanase activity. 



Table 10 




Seed 


Abs 590 


Wild Type 


0 


1A 


1.1 


1B 


0 


1C 


1.124 


1D 


1.323 


2A 


0 


2B 


1.354 


2C 


1.307 


2D 


0 


3A 


0.276 


3B 


0.089 


3C 


0.463 


3D 


0 


4A 


0.026 


4B 


0.605 


4C 


0.599 


4D 


0.642 


5A 


1.152 


5B 


1.359 


5C 


1.035 


5D 


0 


6A 


0.006 


6B 


1.201 


6C 


0.034 


6D 


1.227 


7A 


0.465 


7B 


0 


7C 


0.366 


7D 


0.77 


8A 


1.494 


8B 


1.427 
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8C 0.003 
8D 1.413 
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Effect of transgenic expression of endoglucanase EglA on cell wall composition & in vitro 
digestibility analysis 

Five individual seed from each of two lines, #263 & #266, not expressing or expressing 
Egla (pNOV4803) respectively were grown in the greenhouse. Protein extracts made from small 
leaf samples from immature plants were used to verify that transgenic endoglucanase activity 
was present in #266 plants but not #263 plants. At full plant maturity, -30 days after pollination, 
the whole above ground plant was harvested, roughly chopped, and oven dried for 72 hours. 
Each sample was divided into 2 duplicate samples (labelled A & B respectively), and subjected 
to in vitro digestibility analysis using strained rumen fluid using common procedures (Forage 
fiber analysis apparatus, reagents, procedures, and some applications, by H. K. Goering and 
P. J. Van Soest, Goering, H. Keith 1941 (Washington, D.C.) : Agricultural Research Service, 
U.S. Dept. of Agriculture, 1970. iv, 20 p. : ill. - Agriculture handbook ; no. 379 ), except that 
material was treated by a pre-incubation at either 40°C or 90°C prior to in vitro digestibility 
analysis. In vitro digestibility analysis was performed as follows; 

Samples were chopped to about 1mm with a wiley mill, and then sub-divided into 16 
weighed aliquots for analysis. Material was suspended in buffer and incubated at either 40°C or 
90°C for 2 hours, then cooled overnight. Micronutrients, trypticase & casein & sodium sulfite 
were added, followed by strained rumen fluid, and incubated for 30 hours at 37°C. Analyses of 
neutral detergent fiber (NDF), acid detergent fiber (ADF) and acid detergent lignin (AD-L) were 
performed using standard gravimeteric methods (Van Soest & Wine, Use of Detergents in the 
Analysis of fibrous Feeds. IV. Determination of plant cell-wall constituents, P.J. Van Soest & 
R.H. Wine. (1967). Journal of The AOAC, 50: 50-55; see also Methods for dietry fiber, neutral 
detergent fiber and nonstarch polysaccharides in relation to animal nutrition (1991). P.J. Van 
Soest, J.B. Robertson & B.A. Lewis. J. Dairy Science, 74: 3583-3597.). 

Data show that transgenic plants expressing EglA (#266) contain more NDF than control 
plants (#233), whilst ADF & lignin are relatively unchanged. The NDF fraction of transgenic 
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plants is more readily digested than that of non-transgenic plants, and this is due to an increase in 
the digestibility of cellulose (NDF - ADF - AD-L), consistent with "self-digestion" of the cell- 
wall cellulose by the transgenically expressed endoglucanase enzyme. 



Transgenic seed for pNOV4820 and pNOV4823 were analysed for 6GP1 beta glucanase 
activity using the azo-barley glucan method (megazyme). Enzymatic assays conducted at 50 
degrees C indicate that the transgenic seed have thermophilic 6GP1 beta-glucanase activity while 
no activity was detected in non-transgenic seed (positive signal represents background noise 
associated with this assay). 

Table 1 1 shows activity of the thermophilic beta-glucanase/mannanase 6GP1 in 
transgenic corn seed. Transgenic seed for pNOV4820 (events 1-6) and pNOV4823 (events 7-9) 
were analysed for 6GP1 beta-glucanase activity using the azo-barley glucan method 
(megazyme). Enzymatic assays were conducted at 50 degrees C and the results indicate that the 
transgenic seed have thermophilic 6GP1 beta-glucanase activity while no activity is detected in 
non-transgenic seed. 

Table 1 1 

Seed Abs 590 

Wild Type 0 



Example 45 

Expression of the thermophilic beta-glucanase/mannanase (6GP1) in corn 



2 
3 
4 
5 
6 
7 
8 
9 



0.21 
0.31 
0.36 
0.23 
0.16 
0.14 
0.52 
0.54 
0.49 
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Expression of the mesophilic barley Amy I amylase in com 

A yariety of constructs were generated for the expression of the barley Amyl alpha- 
amylase in corn. The maize y-zein and globulin promoters were used to express the amylase 
specifically in the endosperm or embryo, respectively. In addition, the maize y-zein signal 
sequence and a synthetic ER retention signal were used to regulate the subcellular localization of 
the amylase protein. All 5 constructs (pNOV4867, pNOV4879, pNOV4897, pNOV4895, 
pNOV490l) yielded transgenic plants with alpha-amylase activity detected in the seed. Table 12 
shows the activity in individual seed for 5 independent, segregating events (constructs 
pNOV4879 and pNOV4897). All of the constructs produced some transgenic events with a 
shrivelled seed phenotype indicating that synthesis of the barley Amyl amylase could effect 
starch formation, accumulation, or breakdown. 

Table 12 shows activity of the barley Amyl alpha-amylase in individual corn seed 
(constructs pNOV4879 and pNOV4897). Individual, segregating seed for constructs pNOV4879 
(seed samples 1 and 2) and pNOV4897 (seed samples 3-5) were analysed for alpha-amylase 
activity as described previously. 



Table 12 



Seed 


U/g corn flour 


1A 


19.29 


IB 


1.49 


IC 


18.36 


ID 


1.15 


IE 


1.62 


IF 


14.99 


IG 


1.88 


IH 


1.83 


2A 


2.05 


2B 


36.79 
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Preparation of Xvlanase Constructs 

Table 1 3 lists 9 binary vectors that each contain a unique xylanase expression cassette. The 
xylanase expression cassettes include a promoter, a synthetic xylanase gene (coding 
sequence), an intron (PEPC, inverted), and a terminator (35S). 

Two synthetic maize-optimized endo-xylanase genes were cloned into binary vector 
pNOV21 17. These two xylanase genes were designated BD7436 (SEQ ID NO: 61) and 
BD6002 A (SEQ ID NO:63). Additional binary vectors containing a third maize-optimized 
sequence, BD6002B (SEQ ID NO:65) can be made. 

Two promoters were used: the maize gIutelin-2 promoter (27-kD gamma-zein promoter 
(SEQ ID NO: 12 ) and the rice glutelin-l (Osgtl) promoter (SEQ ID NO: 67). The first 6 
vectors listed in Table 1 have been used to generate transgenic plants. The last 3 vectors can 
also be made and used to generate transgenic plants. 

Vector 1 1560 and 1 1562 encode the polypeptide shown in SEQ ID NO: 62 (BD7436). 
Constructs 1 1559 and 11561 encode a polypeptide consisting of SEQ ID NO: 1 7 fused to the N- 
terminus of SEQ ID NO: 62. SEQ ID NO: 17 is the 19 amino acid signal sequence from the 27- 
kD gamma-zein protein. 

Vector 12175 encodes the polypeptide shown in SEQ ID NO: 64(BD6002A). Vector 
12174 encodes a fusion protein consisting of the gamma-zein signal sequence (SEQ ID NO: 1 7) 
fused to the N-terminus of SEQ ID NO: 64. 

Vectors pWIN062 and pWIN064 encode the polypeptide shown in SEQ ID NO: 
66(BD6002B), Vector pWIN058 encodes a fusion protein consisting of the chloroplast transit 
peptide of maize waxy protein (SEQ ID NO:68) fused to the N-terminus of SEQ ID NO: 66 . 
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Table 13 Xylanase binary vectors 



vector 


Promoter 


otgnal Sequence source 


Ayianase oene 


1 1559 


27kD Gamma-zein 


27kD Gamma-zein 


BD7436 


11560 


27kD Gamma-zein 


None 


BD7436 


11561 


OsGtl 


21 kD Gamma-zein 


BD7436 


11562 


OsGtl 


None 


BD7436 


12174 


27lcD Gamma-zein 


27kD Gamma-zein 


BD6002A 


12175 


27kD Gamma-zein 


None 


BD6002A 


PWIN058 


27kD Gamma-zein 


Maize waxy protein 


BD6002B 


PWIN062 


OsGtl 


None 


BD6002B 


P\vrN064 


27kD Gamma-zein 


None 


BD6002B 



All constructs include an expression cassette for PMI, to allow positive selection of 
regenerated transgenic tissue on mannose-containing media. 

Example 48 

Xvlanase Activity Assay Results 

The data shown in Tables 14 and 15 demonstrate that xylanase activity accumulates in Tl 
generation seed harvested from regenerated (TO) maize plants stably transformed with binary 
vectors containing xylanase genes BD7436 (SEQ ID NO: 61 in Example 47) and BD6002A 
(SEQ ID NO:63 in Example 47). Using an Azo-WAXY assay (Megazyme), activity was 
detected in extracts from both pooled (segregating) transgenic seed and single transgenic seed. 

Tl seed were pulverized and soluble proteins were extracted from flour samples using 

citrate-phosphate buffer (pH 5.4). Flour suspensions were stirred at room temperature for 60 

minutes, and insoluble material was removed by centrifiigation. The xylanase activity of the 

supernatant fraction was measured using the Azo-WAXY assay (McCleary, B.V. "Problems in 

the measurement of beta-xylanase, beta-glucanase and alpha-amylase in feed enzymes and 

animal feeds". In proceedings of Second European Symposium on Feed Enzymes'* (W.van 

Hartingsveldt, M. Hessing, J.P. van der Jugt, and W.A.C Somers Eds.), Noordwiijkerhout, 

Netherlands, 25-27 October, 1995). Extracts and substrate were pre-incubated at 37°C. To 1 

volume of IX extract supernatant, 1 volume of substrate (1% Azo-Wheat Arabinoxylan S- 

AWAXP) was added and then incubated at 37°C for 5 minutes. Xylanase activity in the com 
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flour extract depolymerizes the Azo-Wheat Arabinoxylan by an endo-mechanism and produces 
low molecular weight dyed fragments in the form of xylo-oligomers. After the 5 minute 
incubation, the reaction was terminated by the addition of 5 volumes of 95% EtOH. Addition of 
alcohol causes the non-depolymerized dyed substrate to precipitate so that only the lower 
molecular weight xylo-oligomers remain in solution. Insoluble material was removed by 
centrifiigation. The absorbance of the supernatant fraction was measured at 590nm, and the units 
of xylanase per gram of flour were determined by comparison to the absorbance values from 
identical assays using a xylanase standard of known activity. The activity of this standard was 
determined by a BCA assay, The enzyme activity of the standard was determined using wheat 
arabinoxylan as substrate and measuring the release of reducing ends by reaction of the reducing 
ends with 2,2'-bicinchoninic acid (BCA). The substrate was prepared as a 1 .4% w/w solution of 
wheat arabinoxylan (Megazymc P-WAXYM) in 100 mM sodium acetate buffer pH5.30 
containing 0.02% sodium azide. The BCA reagent was prepared by combining 50 parts reagent 
A with 1 part reagent B (reagents A and B were from Pierce, product numbers 23223 and 23224, 
respectively). These reagents were combined no more than four hours before use. The assay 
was performed by combining 200 microliters of substrate to 80 microliters of enzyme sample. 
After incubation at the desired temperature for the desired length of time, 2.80 milliliters of BCA 
reagent was added. The contents were mixed and placed at 80°C for 30-45 minutes. The 
contents were allowed to cool and then transferred to cuvettes and the absorbance at 560nm was 
measured relative to known concentrations of xylose. The choice of enzyme dilution, incubation 
time, and incubation temperature could be varied by one skilled in the art. 

The experimental results shown in Table 14 demonstrate the presence of recombinant 
xylanase activity in flour prepared from Tl generation corn seed. Seed from 12 TO plants 
(derived from independent T-DNA integration events) were analyzed. The 12 transgenic events 
were derived from 6 different vectors as indicated (refer to Table 13 in Example 47 for 
description of vectors). Extracts of non-transgenic (negative control) corn flour do not contain 
measurable xylanase activity (see Table 1 5). The xylanase activity in these 12 samples ranged 
from 10-87 units/gram of flour. 
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Table 14. Analysis of pooled Tl seed. 



Vprtnr 

v ctiur 


oampie 


Y\/l a nncp I fnitc / fJrQtn rvi FIrviir 

Ayidiidbc \j luio f vjidiTi wi nuui 


i i Jjy 






1 1 <so 

l i j j J 


MHQI fi1949fl 






N/inQI m 1 9Qfi 


0*5 


1 1 JOU 


MHQI 01 1 **99 


91 


1 1 Jul 


MDQI 019411 


87 


11561 


MD9L012443 


83 


11562 


MD9L012890 


13 


11562 


MD9L013788 


12 


12174 


MD9L022080 


16 


12174 


MD9L022195 


10 


12175 


MD9L022061 


74 


12175 


MD9L022134 


69 



The results in Table 1 5 demonstrate the presence of xylanase activity in corn flour 
derived from single kernels. Tl seed from two TO plants containing vectors 1 1561 and 1 1559 
were analyzed. These vectors are described in Example 47. Eight seed from each of the two 
plants were pulverized and flour samples from each seed were extracted. The table shows results 
of single assays of each extract. No xylanase activity was found in assays of extracts of seeds 1, 
5, and 8 for both transgenic events. These seed represent null segregants. Seed 2, 3, 4, 6, and 7 
for both transgenic events accumulated measurable xylanase activity attributable to expression of 
the recombinant BD7436 gene. All 10 seed that tested positive for xylanase activity (>10 
unit/gram flour) had an obvious shriveled or shrunken appearance. By contrast the 6 seed that 
tested negative for xylanase activity (<1 unit/gram flour) had a normal appearance. This result 
suggests that the recombinant xylanase depolyrnerized endogenous (arabino)xylan substrate 
during seed development and/or maturation. 



Table 15. Analysis of single Tl seed. 



Vector 11561 



Vector 11559 
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Seed 


Xylanase Units/ 


Seed 


Xylanase Units / 


Number 


Gram of Flour 


Number 


Gram or Flour 


1 


0 


1 


1 


2 


45 


2 


52 


3 


38 


3 


21 


4 


40 


4 


13 


5 


0 


5 


0 


6 


40 


6 


28 


7 


32 


7 


23 


8 


0 


8 


0 



Example 49 

Enhanced starch recovery from corn seed using enzymes 

Corn wet-milling includes the steps of steeping the corn kernel, grinding the corn kernel, 
and separating the components of the kernel. A bench top assay (the Cracked Com Assay) was 
developed to mimic the corn wet-milling process 

The "Cracked Corn Assay" was used for identifying enzymes that enhance starch yield 
from maize seed resulting in an improved efficiency of the corn wet milling process. Enzyme 
delivery was either by exogenous addition, transgenic corn seed, or a combination of both. In 
addition to the use of enzymes to facilitate separation of the corn components, elimination of SO2 
from the process is also shown. 

Cracked Corn Assay. 

One gram of seed was steeped overnight in 4000, 2000, 1000, 500, 400, 40, or 0 ppm 
SO2 at 50 degrees C or 37 degrees C. Seeds were cut in half and the germ removed. Each half 
seed was cut in half again. Steep water from each steeped seed sample was retained and diluted 
to a final concentrations ranging from 400 ppm to 0 ppm SO2. Two milliliters of the steep water 
with or without enzymes was added to the de-germed seeds and the samples placed at 50 
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degrees C or 37degrees C for 2-3 hours. Each enzyme was added at 10 units per sample. All 
samples were vortexed approximately every 15 minutes. After 2-3 hours the samples were 
filtered through mira cloth into a 50ml centrifuge tube. The seeds were washed with 2 ml of 
water and the sample pooled with the first supernatant. The samples were centrifuged for 15 
minutes at 3000 rprn. Following centrifugation, the supernatant was poured off and the pellet 
placed at 37 degrees C to dry. All pellet weights were recorded. Starch and protein 
determinations ware also carried out on samples for determining the starch iprotein ratios released 
during the treatments (data not shown). 



Anaylsis of Tl and T2 seed from maize plants expressing 6GP1 endoglucanase in 

Cracked corn Assay 

Transgenic corn (pNOV4819 and pNOV4823) containing a thermostable endoglucanse 
performed well when analyzed in the Cracked Corn Assay. Recovery of starch from the 
pNOV4819 line was found to be 2 fold higher in seeds expressing the endoglucanase when 
steeped in 2000 ppm SO2. Addition of a protease and cellobiohydrolase to the endoglucanse 
seed increased the starch recovery approximately 7 fold over control seeds. See Table 16. 



Table 16. Crack Corn Assay results for cytosolic expressed 
Endoglucanase (pNOV4820). Control line, A188/HiII 
PNOV4819 lines, 42C6A-1-21 and 27. 







wmmM 


A188/HiII Control 


No Enzyme 


28.4 


A188/HiII Control 


Broaielain/C8546 10U 


109.3 








H2C6A-1-21 


No Enzyme 


S2.6 


42C6A-1-21 


Bronjelain/C8546 10U 


170.4 








42C6A-1-27 


No Enzyme 


60.5 


42C6A-1-27 


Bromelain/C8546 10U 


207.5 



Similar results 
were seen in transgenic 
seed containing 
endoglucanase targeted 
to the ERofthe 
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endosperm (pNOV4823), again resulting in a 2 -7 fold increase in starch recovery when 
compared to control seed. See Table 1 7. 



Table 17. Crack Corn Assay results for ER expressed 



endoglucanase (pNOV4823). Control line, A188/HHI; 

PNOV4823 line, 101D1 1A-1-28. 



Line 


Treatment 


Starch 
Pellet Wt 
(mg) 


Starch 
Pellet Wt 
(mg) 


Mean Wt 


A188/Hill 


No Enzyme 


22.5 


19.1 


20.8 


101O11A-1-28 


No Enzyme 


41.2 


32 


36.6 












A188/HHI 


10U Bromelian/C8546 


78.6 


73.8 


76.2 


101D11A-1-28 


10U Bromelian/C8546 


169.8 


132.6 


151.2 



These results confirm that expression of an endoglucanase enhances the separation of starch and 
protein components of the corn seed. Further more it could be shown that reduction or removal 
of S02 during the steeping process resulted in starch recovery that was comparable to or better 
than normally steeped control seeds. See Table 18. Removal of high levels of S02 from the 
wet-milling process can provide value-added benefits. 

Table 1 8. Comparison of various concentrations of S02 on starch recovery from transgenic 
6GP1 seed. 



Line 


Treatment 


Starch Pellet Wt 
(mg) 


A188 Control 


2000 ppm S02 


18.5 


JHAF Control 


2000 ppm S02 


29.1 


42C (pNOV4820) 


2000 ppm S02 


29.5 


101C(pNOV4823) 


2000 ppm S02 


73.1 


101D(pNOV4823) 


2000 ppm S02 


42.5 


136A (pNOV4825) 


2000 ppm S02 


36.6 
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137A (pNOV4825) 


2000 ppm S02 


38.6 








42C (PNOV4820) 


400 ppm S02 


18.5 


101C(pNOV4823) 


400 ppm S02 


20.4 


101D(pNOV4823) 


400 ppm S02 


39.7 


136A (pNOV4825) 


400 ppm S02 


26 


137A(pNOV4825) 


400 ppm S02 


26.9 








42C (PNOV4820) 


0 ppm S02 


21.9 


101C(pNOV4823) 


0 ppm S02 


32.5 


101D(pNOV4823) 


0 ppm S02 


39 


136A (pNOV4825) 


0 ppm S02 


17.8 


137A(pNOV4825) 


0 ppm S02 


29.2 



Example 50 

Construction of transformation vectors for maize optimized bromelain 

Expression cassettes were constructed to express the maize optimized bromelain in maize 
endosperm with various targeting signals as follows: 

pSYNl 1000 (SEQ ED NO. 73 ) comprises the bromelain signal sequence 
(N4AWKVQVVFLFLFLCVM WASPS AASA ) (SEQ ID NO: 72) and synthetic bromelain 
sequence fused with a C-terminal addition of the sequence VFAEAIAANSTLVAE for targeting 
to and retention in the PVS (Vitale and Raikhel Trends in Plant Science Vol 4 no.4 pg 149-155). 
The fusion was cloned behind the maize gamma zein promoter for expression specifically in the 
endosperm. 

pSYNl 1587 (SEQ ID NO:75) comprises the bromelain N-terminal signal sequence 
(MAWKVQVVFLFLFLCVM WASPS AASA) and synthetic bromelain sequence with a C- 
terminal addition of the sequence SEKDEL for targeting to and retention in the endoplasmic 
reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the maize gamma 
zein promoter_for expression specifically in the endosperm. 

pSYNl 1 589 (SEQ ID NO. 74) comprises the bromelain signal sequence 
(M A WKVQVVFLFLFLCVM WASPS AASA) (SEQ ID NO: 72) fused to the lytic vacuolar 
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targeting sequence SSSSFADSNPIRVTDRAAST (Neuhaus and Rogers Plant Molecular 
Biology 38:127-144, 1998) and synthetic bromelain for targeting to the lytic vacuole. The 
fusion was cloned behind the maize gamma zein prmoter for expression specifically in the 
endosperm. 

pS YN 1 2 1 69 (SEQ ID NO: 76) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ [D NO: 17) fused to the synthetic bromelain for targeting 
to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The fusion 
was cloned behind the maize gamma zein promoter for expression specifically in the endosperm. 

pSYN 12575 (SEQ ID NO:77) comprises the waxy amyloplast targeting peptide (Klosgen 
et al., 1986) fused to the synthetic bromelain for targeting to the amyloplast. The fusion was 
cloned behind the gamma zein promoter for expression specifically in the endosperm. 

pSM270 ( SEQ ID NO. 78 ) comprises the bromelain N-terminal signal sequence fused 
to the lytic vacuolar targeting sequence SSSSFADSNPIRVTDRAAST (Neuhaus and Rogers 
Plant Molecular Biology 38:127-1 44, 1998) and synthetic bromelain for targeting to the lytic 
vacuole. The fusion was cloned behind the aleurone specific promoter P19 (US Patent 6392123) 
for expression specifically in the aleurone. 

Example 51 

Expression of bromelain in corn 

Seeds from Tl transgenic lines transformed with vectors containing the synthetic 

bromelain gene with targeting sequences for expression in various subcellular location of the 

seed were analyzed for protease activity. Corn-flour was made by grinding seeds, for 30 sec, in 

the Kleco grinder. The enzyme was extracted from 100 mg of flour with 1 ml of 50 mM NaOAc 

pH4.8 or 50 mM Tris pH 7.0 buffer containing ImM EDTA and 5 mM DTT. Samples were 

vortexed, then placed at 4C with continuous shaking for 30 min. Extracts from each transgenic 

line was assayed using resorufin labeled casein (Roche, Cat. No. 1 080 733) as outlined in the 

product brochure. Flour from T2 seeds were assayed using a bromelain specific assay as 

outlined in Methods in Enzymology Vol. 244: Pg 557-558 with the following modifications. 

lOOmgof corn seed flour was extracted with 1ml of 50mMNa 2 HPO4/50mM NaH 2 P0 4 , pH 7.0, 1 
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mM EDTA +/- l^M leupeptin for 15 min at 4°C. Extracts were centriftiged for 5 min at 14,000 
rpm at 4°C. Extracts were done in duplicates. .Flour from T2 Transgenic lines was assayed for 
bromelain activity using Z-Arg-Arg-NHMec (Sigma) as a substrate. Four aliquots of 100/il /corn 
seed extracts were added to 96 well flat bottom plates (Corning) containing 50/xI 
100mMNa 2 HPO 4 /100mM NaH 2 P0 4 > pH 7.0, 2mM EDTA, 8mM DTT/well. The reaction was 
started by the addition of 50fi\ of 20/xM Z-Arg-Arg-NHMec. The reaction rate was monitor 
using a SpectraFluorPlus(Tecan) fitted with a 360nm excitation and 465nm emission filters at 
40°C at 2.5min intervals. 

Table 19 shows the analysis of seed from different Tl bromelain events. Bromelain 

expression was found to be 2-7 fold higher than the A 188 and JHAF control lines. Tl 

transgenic lines were replanted and T2 seeds obtained. Analysis of T2 seeds showed 

expression of bromelain. Figure 21 shows bromelain activity assay using Z-Arg-Arg-NHMec_in 

T2 seed for ER targeted (1 1587) and lytic vacuolar targeted (1 1589) bromelain. 

Analysis ofT2 seed from maize plants expressing Bromelain 

Seed from T2 transgenic bromelain line, 1 1587-2 was analyzed in the Cracked Corn 
assay for enhanced starch recovery. Previous experiments using exogenously added bromelain 
showed an increased starch recovery when tested alone and in combination with other enzymes, 
particularly cellulases. The T2 seed from line 1 1 587-2 showed a 1.3 fold increase in starch 
recovered over control seed when steeped at 37C/2000 ppm S02 overnight. More importantly, 
there was the 2 fold increase in starch from the T2 bromelain line, 1 1587-2 when a cellulase 
(C8546) was added when seeds were steeped at 37C/2000 ppm S02. 

The transgenic line showed a similar trend in increased starch over control seed when 
seeds were steeped at 37C/400 ppm S02. A 1 .6 fold increase starch recovered over control was 
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seen in the transgenic seed and a 2.1 fold increase of starch with addition of a cellulase (C8546). 
See Table 20. 

These results are significant in showing that it is possible to reduced temperature and 
S02 levels while also enhancing the starch recovery during the wet-milling process when 
transgenic seed expressing a bromelain is used. 



Table 19 



Summary of Grain Specific Expression of Bromelain in Tl corn. 



* 

Line Number 


Tarqeting : . 


.** v . . ' .».■«*• » < 

Construct 


. ^Specific Activity^ ng 

•tBrbmelaln/protein • 


11000-1 


Vacuolar 


GZP/probromelain/barleyPVS 


252 


11000-2 


Vacuolar 


GZP/probromeiain/barleyPVS 


277 


11000-3 


Vacuolar 


GZP/probromelain/barleyPVS 


284 


11587-1 


ER 


GZP/probromelain/KDEL 


174 


11587-1 


ER 


GZP/probromelain/KDEL 


153 


11589-1 


Lytic Vacuolar 


GZP/aleurainSS/probromelain 


547 


11589-2 


Lytic Vacuolar 


GZP/aleurainSS/probromelain 


223 






A188 Control 


56 






JHAF Control 


75 



Table 20 Cracked Corn Assay results for T2 Bromelain seed 



Steep Conditions 


tine- - ' 


Starch Pellet Wt. (mjg) 


2000 ppm S02 


A188 


41.3 


2000 ppm S02 


A188/C8546 (10 units) 


44 


2000 ppm S02 


11587-2 


57.4 


2000 ppm S02 


11587-2/C8546 (10 units) 


94.6 


400 ppm 


A188 


30.7 


400 ppm 


A188/C8546 (10 units) 


35.8 


400 ppm 


11587-2 


50.5 


400 ppm 


11587-2/C8546 (10 units) 


86.6 



143 



WO 2005/096804 



PCT/US2004/007182 



Example 52 

Construction of transformation vectors for maize optimized ferulic acid esterase. 
Expression cassettes were constructed to express the maize optimize ferulic acid esterase 

in maize endosperm with or without various targeting signals as follows: 

Plasmid 13036 (SEQ ID NO: 101) comprises the maize optimize ferulic acid esterase 

(FAE) sequence (SEQ ID NO: 99). The sequence was cloned behind the maize gamma zein 

promoter without any targeting sequences for expression specifically in the cytosol of the 

endosperm. 

Plasmid 13038 (SEQ ID NO: 103) comprises the maize y-zein N-terminal signal 
sequence (MRVLLVALALLALAASATS)(SEQ ID NO: 17) ftised to the synthetic FAE for 
targeting to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The 
fusion was cloned behind the maize gamma zein promoter for expression specifically in the 

endosperm. 

Plasmid 13039 (SEQ ID NO: 105) comprises the waxy amyloplast targeting peptide 
(MLAALATSQLVATRAGLGVPDASTFRRGAAQGLRGARASAAAD 
TLSMRTSARAAPRHQHQQARRGARFPSLVVCASAGA) (KJosgen et al., 1986) fused to the 
synthetic FAE for targeting to the amyloplast. The fusion was cloned behind the gamma zein 
promoter for expression specifically in the endosperm. 

Plasmid 13347 (SEQ ID NO: 107) comprises the maize y-zein N-terminal signal 
sequence (MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic FAE sequence 
with a C-terminal addition of the sequence SEKDEL for targeting to and retention in the 
endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the 
maize gamma zein promoter/or expression specifically in the endosperm. 

All expression cassettes were moved into a binary vector pNOV21 17 for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 
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Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable cotransformation. 



Synthetic Ferulic Acid Esterase Sequence (SEP ID NO: 99) 
atgeccecctccctcccgaccatgcceccgtccggctacgac^ 

ctccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaagaagtactccgtgc(ctacctcctccacggcatcggcggctccga gaacgactggtt 

cgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgagggcaagatcaagccgctcatcatcgtgaccccga acaccaacgccgccggcccgg 

gcatcgccgacggctacgagaacrtcaccaaggacctcctcaactccctcatcccgtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatcgc 

cggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctcgacaagttcgcctacatcggcccgatctccgcc gccccgaacacctacccgaacga 

ucgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctcncatcgcctgcggcacc aacgactccctcatcggcncggccagcgcgtgcacg 

aptactgcgtggccaacaacatcaaccacgtgtactggctcatccagggcggcggcca cgacttcaacgtgtggaagccgggcctctggaacttcctccagatggccg 

acgaggccggcctcacccgcgacggcaacaccccggtgccgaccccgtcccceaagccggccaacacccgcatcgaggcc gaggactacgacggcatcaactcc 

tcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctacatcacctccggcgacta cctcgtgtacaagtccatcgacttcggcaacggcgcc 

acctcctlcaaggccaaggtggccaacgccaacacctccaacatcgagcttcgcctcaacggcccgaacg gcaccctcatcggcaccctctccgtgaagtccaccggc 

gactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatcaacgac ctctacctcgtgttcaagggcccggtgaacatcgactggttcaccttcg 

gcgtgtag 

S ynthetic Ferulic Acid Esterase Amino Acid Sequence (SEP ID NO: 100) 

maaslptmppsgydq\Tngv^rgqv\^isvfstam^^ 

piadgvcnftkdllnslipviesnvsvvtfrehraiaglsmgggq 

vanninhvywliq g pghdfnvwkpglv\Ttflqmadeaglh-dgnfr^^^ 

sfkakvanantsnielrlnep n gtligtlsykstgd^iiWeeqtcsiskvlgindlylvfkgpvnidwfrfgv* 



13036 Sequence (SEP ID NO: 101) 

at ggccgcctccctcccgaccatgccgccgtccggctacgaccaggtgcgcaacggcg tgccgcgcggccaggtggtgaacatctcctacttctccaccgccaccaa 

ctccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaagaagtac tccgtgctctacctcctccacggcatcggcggctccgagaacgactggtt 

cgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgagggcaagatcaagccgctcatcatc gtgaccccgaacaccaacgccgccRgcccgg 

gcatcgccgac g gctacgagaacttcaccaaggacctcctcaactccctcatcccgtacatcgagtccaactactccgtg tacaccgaccgcpapcaccgcgccatcgc 

cggcctctctatg g gcggcggccagtccttcaacatcggcctcaccaacctcgacaagttcgcctacatcggcccgatctccgccg ccccgaacacctacccgaacga 

gcgcctcnccc ggacg gcggcaaggccgcccgcgagaagctcaagctcctcttcatcgcctgcggca ccaaceactccctcatcggcttcggccagcgcglgcacg 

agtactgcgtggccaacaacatcaaccacgtgtactggctcatccagggcggcggccacgacttcaacgtgt ggaagccgggcctctggaacttcctccagatRgccg 

acgag gcc ggcctcacccgcgacggcaacaccccggtgccgaccccgtccccgaagccggccaacacccgcatcgag gccgaggactacgacggcatcaactcc 

tcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcg gctacatcacctccggcgactacctcgtgtacaagtccatcgacttcggcaacRRcgcc 

acctccncaaggccaaggtggccaacgccaacacctccaacatcgagcncgcctcaacggcccgaacggcaccctc atcggcaccctctccgteaagtccaccggc 

gact ggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatcaacgacctctacctcgtgttcaagggcccggtgaacatCRactgRttcaccttcR 

gcgtgtag 

1 3036 A A Sequence (SEP ID NO: 102) 

ma3slptTnppsgvdqvTng\Trgqv\Tusyfstar^ 

piadpvenftkdllnslipviesnvsvvtdrehjaiaglsmgggqsfn^ 

vanninhv^vliqggghdfnvwkpglN^ 

sfkakvanantsnielrlngpngtligtlsvkstgd^Tirveeqtcsisk^gindlvl vflcgpvnidwMgv* 
13038 Sequence (SEP ID NO: 103) 

atgagggt gttgctcgngccctcgctctcctggctctcgctgcgagcgccacctccatggccgcctccctcc cgaccatgccgccgtccggctacgaccaggtRCRca 
acggcgtgccgcgcggccaggtg gt gaacatctcctacnctccaccgccaccaactccacccgcccggcccgcgtgt acctcccgccgggctactccaaggacaaR 
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aagtactccgtgctctacctcctccacgecatcgec£gctccgagaac^actggnc gagpgcggcggccficeccaacgteatc&ccgacaacctcatc2cceaege 

caagatcaagccgctcatcatcgtgaccccgaacaccaacgccgccggcccgggcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatccc 

gtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatcgccggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctCRac 

aaettcgcctacatcggcccgatctccgccgccccgaacacctaccceaacgagcgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctctt 

catcgcctgcggcaccaacgactccctcatcggctlcggccagcgcgigcacgagtactRCgtggccaacaacatcaaccacgtgtactggctcatccagggcRRCRg 

ccacgacncaacgtgtg&aagccgg&cctctggaacttcctccagatggccgacgaggccggcctcacccgcgac ggcaacaccccggtgccgaccccgtccccg 

aagccggccaacacccecatcgaggccgaggactacgacggcatcaactcctcctccatcgagatcatcggcgtgccgccggagggcggcc gcggcatcggctac 

atcacctccggcgactacctcgtgtacaagtccatcgacncggcaacggcgccacctccacaaggccaaggtggccaacgccaacacctccaacatcgagcrTcgcc 

(caacggcccgaacggcaccctcatcggcaccctctccgtgaafitccaccggcgactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatc 

aacgacc(ctacctcgtgucaagggcccggtgaacatcgactgg»caccttcggcgtgtag 

13038 A A Sequence (SEP ID NO: 104) 

mnMvalallalaasatsmaaslptTTipnsgv^ 

gkikpliivmnmaagpgiadgyenftkdtl^ 

cgmdsligfcqrvheYCvanninhvyAvliqggghdfa^^ 

(sgdvKvksidfgngatsfkakvanantsnielrlngpngtligtlsvkstgdwntveeqtcsiskvtgindlvlvfVgpvni 
1 3Q39 Sequence (SEP ID NO: 105) 

atgclggcggctctggccacgtcgcagclcgtcgcaacgcgcgccggcctgggcgtcc cggacgcgtccacgttccgccgcggcgccgcgcagggcctgagggg 

ggcccg g gcgtcggcggcgEcggacacgctcagcatgcggaccagcgcgcgcgcggcgcccaggcaccagcaccagcaggcgcgccgcggRRCcaggttcc 

ce(cgctcgrcgtgtgcgccagcgccggcgccatggccgcctccctcccgaccatgc cgccgtccggctacgaccagg(gcgcaacggcgtgccgcgcggccaggt 

gHigaacatctcctacttctccaccgccaccaactccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaagaagtactccgtgctctacctcctcc 

acggcatcggcggctccgagaacgactggttcgag ggcggc ggccgcgccaacg tgatcgccgacaacctcatcgccgagggcaagatcaagccgctcatcatcgt 

gaccccgaacaccaacgccgccggcccgggcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatcccgtacatcgagtccaactactccgt 

gtacaccgaccgcgagcaccgcgccatcgccggcctctctatgggcggcggccagtccttcaacatcggcctcacc aacctcgacaagttcecctacatcggcccgat 

ctccgccgccccgaacacctacccgaacgagcgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctcttcatcgc ctgcggcaccaacgact 

ccctcatcggctlcggccagcgcgtgcacgagtactgcgtggccaacaacatcaaccacgtgtactggctcatccagggc ggcggccacgacttcaacgtgtggaagc 

cgggcctctggaacttcctccagatggccgacgaggccggcctcacccgcgacggcaacaccccggtgccgaccccgtccccgaagccggc caacacccgcatcg 

aggccgaggactacgacggcatcaaclcctcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctacatcacctccgg cgactacctcgtg 

(acaagtccatcgacttcggcaacggcgccacctccttcaaggccaaggtggccaacgccaacacctccaacatcgagcttcgcctcaacggcccgaacggcaccctc 

atcggcaccctctccgtgaaetccaccggcgactggaacacctacgaggagcagacctgctccatctccaaggtga ccpgcatcaacgacctctacctcgtgttcaagg 

gcccg gi gaacatcgactggttcaccttcggcgtgtag 

13039 A A Sequence (SEP ID NO: 106) 
mlaalatsqlvatraglgv^dastfrrgaaqglrgarasaaadilsm 
s vfstamstrpan'vlppgvskdJckysvlvllhgiggsendNvfe^ 
hraiaglsmgggqsfhigltnldkfayigpisaapntvpnerln^ 

nfl qmadeagltrdgTitpvT)hpspkpantrieacdvdginsssieiigvT>peggTgigviisgdvlv^ksidfgn 
stgdwntYeeqtcsisk^gindlyivfkgpvnidvvftfgv* 

13347 Sequence (SEP ID NO: 107) 

atgagggtgngctc gttgccctcgctctcctggctctcgctgcgagcgccacctccatggccgcctccctcccgaccatgccgccgtccggctacgaccaggtgcgca 

acggcgtgccgcgcggccaggtggtgaacatctcclacttctccaccgccaccaactccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaag 

aagtactccgtgctctacctcctccacggcatcggcggctccgagaacgactggttcgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgaggg 

caagaicaagccgctcatcatcgtgaccccgaacaccaacgccgccggcccgggcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatccc 

gtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatcgccggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctcgac 

aagncgcctacatcggcccgatctccgccgccccgaacacctacccgaacgagcgcctcttcccggacggcggcaaggccgcccgcgagaa gctcaagctcctctt 

catcgcctgcggcaccaacgactccctcatcggctlcggccagcgcgtgcacgagtactg cgtggccaacaacatcaaccacgtgtactggctcatccagggcggcgg 

ccacgacncaacgtgtggaagccgggcctctggaacttcctccagatggccgacgaggccggcctcacccgcgacggcaacaccccg gtgccgaccccgtccccg 

aagccggccaacacccgcatcgaggccgaggactacgacggcatcaactcctcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctac 

atcacctccggcgactacctcgtgtacaagtccatcgacrtcggcaacggcgccacctccttcaaggccaagglggccaacgccaacacctcca acatcgagcttcgcc 
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tcaacggccc g aacggcaccctcatcggcaccctctccgtgaagtccaccg g cgactggaacacctacgagga&cagacctgctccatctccaagetgaccegcatc 
aacpacctctac ctcgt^ ttcaag|igtcccggtgaacatcgactg|gttcaccttcgg cptgtccgagaaggacgaactctag 

13347 Sequence fSEO ID NO: 108) 

mn'llvala)laIaasatsmaas1ptmppsgvdq\Tng^ 
gkikpliivtpntaaagpgiadgvenftkdllnslipyies^ 

cgtndsligfgqrvhevcvarminhvywliqggghdfnvwkpglvvTiflqmadeagltrdgnm 
tsgdvlvvksidfgngatsfkakvanantsnielrlngpngtligtlsvkstgdwntveeQtcsiskvtgindlvW 



Example 53 

Hvdrolvtic degradation of corn fiber by ferulic acid esterase 
Corn fiber is a major by-product of corn wet and dry milling. The fiber component is 
composed primarily of course fiber arising from the seed pericarp (hull) and aleurone, with a 
smaller fraction of fine fiber coming from the endosperm cell walls. Ferulic acid, a 
hydroxycinnamic acid, is found in high concentrations in the cell walls of cereal grains resulting 
in a cross linking of lignin, hemicellulose and cellulose components of the cell wall. Enzymatic 
degradation of ferulate cross-linking is an important step in the hydrolysis of corn fiber and may 
result in the accessibility of further enzymatic degradation by other hydrolytic enzymes. 

Ferulic Acid Esterase Activity Assay 

Ferulic acid esterase, FAE-1 , ( maize optimised synthetic gene from C. thermocellum) 
was expressed in £. coli. Cells were harvested and stored at -80°C overnight. Harvested 
bacteria was suspended in 50mM Tris buffer pH7.5. Lysozyme was added to a final 
concentration of 200 ug/mL and the sample incubated 10 minutes at room temperature with 
gently shaking. The sample was centrifuged at 4 °C for 1 5 minutes at 4000 rpm. Following 
centrifugation, the supernatant was transferred to a 50 mL conical tube, and placed in 70 degree 
Celsius water bath for 30 minutes. The sample was then centrifuged for 1 5 minutes at 4000 rpm 
and the cleared supernatant transferred to a conical tube ( Blum et al. J Bacteriology, Mar 2000, 
pg 1346-1351.) 
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The recombinant FAE-I was tested for activity using 4-methylumbelliferyl ferulate as 
described in Mastihubova et al (2002) Analytical Biochemistry 309 96-101. Recombinant 
protein FAE-1 (104-3) was diluted 10, 100, and 1000 fold and assayed. Activity assay results 
are shown in Figure 22. 

Preparation of Corn Seed Fiber 

Corn pericarp coarse fiber was isolated by steeping yellow dent #2 kernels for 48hrs at 50 
°C in 2000 ppm sodium metabisulfite( (Aldrich). Kernels were mixed with water in equal parts 
and blended in a Waring laboratory heavy duty blender with the blade in reverse orientation. 
Blender was controlled with a variable autotransformer (Staco Energy) at 50% voltage output for 
2 min. Blended material was washed with tap water over a standard test sieve #7(Fisher 
scientific) to separate coarse fiber from starch fractions. Coarse fiber and embryos were 
separated by floating the fiber way from the embryos with hot tap water in a 4L beaker (Fisher 
scientific). The fiber was then soaked in ethanol prior to drying overnight in a vacuum oven( 
Precision) at 60° C. Com coarse fiber derived form corn kernel pericarp was milled with a 
laboratory mill 3100 fitted with a mill feeder 31 70(Perten instruments) to 0.5mm particle size. 

Corn Fiber Hydrolysis Assay 

Course fiber (CF) was suspended in 50 mM citrate-phosphate buffer, pH 5.2 at 30 mg/ 5 
ml buffer. The CF stock was vortexed and transferred to a 40 ml modular reservoir (Beckman, 
Cat. No. 372790). The solution was mixed well then 100 ul transferred to a 96 well plate 
(Coming Inc., Cat. No.9017, polystyrene, flat bottom). Enzyme was added at 1-10 ul/well and 
the final volume adjusted to 1 10 ul with buffer, CF background controls contained 10 ul of 
buffer only. Plates were sealed with aluminum foil and incubated at 37°C with constant shaking 
for 1 8 hours. The plates were centrifiiged for 1 5 min at 4000 rpm. 1-10 ul of CF supernatant 
was transferred to a 96 well plate preloaded with 100 ul of BCA reagents (BCA-reagents: 
Reagent A (Pierce, ProdJ 23223), Reagent B (Pierce, Prod.# 23224). The final volume was 
adjusted to 1 10 ul. The plate was sealed with aluminum foil and placed at 85°C for 30 min. 
Following incubation at 85°C, the plate was centrifuged for 5 min at 2500 rpm. Absorbance 
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values were read at 562 nm (Molecular Devices, Spectramax Plus). Samples were quantified 
with D-glucose and D-xylose (Sigma) calibration curves. Assay results are reported as total 
sugar released. 

Measurement of total sugar released bv Ferulic Acid Esterase in Corn Seed Fiber Hydrolysis 

Assay 

Results from the recombinant FAE-1 fiber hydrolysis assay showed no increase in total 
reducing sugars (data not shown). These results were not unexpected since it has been reported 
in the literature that an increase in total reducing sugars is detectable only when other hydrolytic 
enzymes are used in combination with the FAE ( Yu et al J. Agric. Food Chem, 2003, 51,218- 
223). Figure 23 shows that addition of FAE-2 to a fungal supernatant which had been grown on 
com fiber, shows and increase in total reducing sugars. This suggests that FAE does play an 
important role in corn fiber hydrolysis. 

Figure 23 shows Corn Fiber Hydrolysis assay results showing increase in release of total 
reducing sugars from corn fiber with addition of FAE-2 to fungal supernatant (FS9). 

Analysis of Ferulic Acid released from corn seed fiber bv FAE-1 

FAE activity on corn fiber was tested by following the release of ferulic acid as described 
in Walfron and Parr (1996) I Waldron. KW. Parr AJ 1996 Vol 7 pages 305-312 Phvtochem 
Anal ) with slight modification. Corn coarse fiber derived from corn kernel pericarp was milled 
with a laboratory mill 3100 fitted with a mill feeder 3170 (Perten instruments) to 0.5mm particle 
size and used as substrate at a concentration of 10 mg/ml. 1 ml assays were conducted in 24 well 
Becton Dickenson Multiwell™. Substrate was incubated in 50 iruM citrate phosphate pH 5.4 at 
50° C at 1 10 rpm for 18 hrs in the presence and absence of recombinant FAE. After the 
incubation period, samples were centrifuged for 10 minutes at 13,000 rpm prior to ethyl acetate 
extraction. All solvents and acids used were from Fisher Scientific. 0.8 ml of supernatant was 
acidified with 0.5 ml acetic glacial acid and extracted three times with equivalent volume of 
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ethyl acetate. Organic fractions were combined and speed vac to dryness (Savant) at 40° C. 
Samples were then suspended with \Q0\x\ of methanol and used for HPLC analysis. 

HPLC chromatography was carried out as follows. Ferulic acid (ICN Biomedicals) was 
used as standard in HPLC analysis (data not shown). HPLC analysis was conducted with a 
Hewlett Packard series 1 100 HPLC system. The procedure employed a C 18 folly capped reverse 
phase column (XterraRpis, 150mm X 3.9mm i.d. 5fim particle size) operated in 1.0 ml min at 
40°C. Ferulic Acid was eluted with a gradient of 25 to 70 % B in 32 min (solvent A: H20, 
0.01 %b TFA; solvent B: MeCN, 0.0075%). 

As shown in Figure 24, FA released from corn fiber was 2-3 fold higher than control 
when treated with 10 or 100 ul of FAE-1 . These results clearly show that FAE-1 is capable of 

hydrolyzing corn fiber. 

Example 54 

Functionality in fermentation of maize expressed glucoamvlase and amylase 

This example demonstrates that maize-expressed enzymes will support fermentation of starch in 
a corn slurry in the absence of added enzyme and without cooking the corn slurry. Maize kernels 
that contain Rhizopus ozyzae glucoamylase (ROGA) (SEQ ID NO: 49) were produced as 
described in Example 32. Maize kernels that contain the barley low-pi oc-amylase (AMYI) (SEQ 
ID NO: 88) are produced as described in Example 46. The following materials are used in this 
example: 

Aspergillus niger glucoamylase (ANGA)was purchased from Sigma. 

Rhizopus species glucoamylase (RxGA) was purchased from Wako as a dry crystalline 

powder and made up in 10 mM NaAcetate pH 5.2, 5 mM CaCb. at 10 mg/ml. 
MAMYI Microbially produced AMYI was prepared at approximately 0.25 mg/ml in 10 

mM NaAcetate pH 5.2, 5 mM CaCI 2 . 
Yeast was Saccharomyces cereviceae 
YE was a sterile 5% solution of yeast extract in water 
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Yeast starter contained 50 g maltodextrin, 1.5 g yeast extract, 0.2 mg ZnS0 4 in a total 
volume of 300 ml of water, the medium was sterilized by autoclaving after 
preparation. After cooling to room temperature, 1 ml of tetracycline (10 mg/ml 
in ethanol), 100 |il AMG300 glucoamylase and 155 mg active dry yeast, were 
added. The mixture was then shaken at 30 °C for 22 h. The overnight yeast 
culture was diluted 1/10 with water and A600 measured to determine the yeast 
number, as described in Current Protocols in Molecular Biology. 

ROGA flour Kernels were pooled from several TO lines shown to have active 

glucoamylase The seeds were ground in the Kleco, and all flour was pooled , 

AMY I flour Kernels from TO corn expressing AMYI were pooled and ground as above. 

Control flour Kernels from with similar genetic background were ground in the same 
fashion as the ROGA expressing corn 

An inoculation mixture was prepared in a sterile tube; it contained per 1.65 ml: yeast cells (lx 
1 0 7 ), yeast extract (8.6 mg), tetracycline (55 jig). 1 ,65 ml was added / g flour to each 
fermentation tube. 

Fermentation preparation: Flour was weighed out at 1.8 g / tube into tared 17 x 100 mm 
sterile polypropylene. 50 ^1 of 0.9 M H 2 S0 4 was added to bring the final pH prior to 
fermentation to 5. The inoculation mixture (2.1 ml) was added / tube, along with 
RXGA, AMYJ-P and amylase desalting buffer as indicated below. The quantity of 
buffer was adjusted based on moisture content of each flour so that the total solids 
content was constant in each tube. The tubes were mixed throroughly, weighed and 
placed into a plastic bag and incubated at 30 °C. 



Table 21 





Flours 


Innoculation 


Microbial enzymes 


Amylase 
desalting 

Buffer 


Tube 


Control 


ROGA 


AMYI 


Mix 


RXGA 


AMYI-P 






9 


9 


ml 


ml 


ml 


ml 


A 


1.8 






2.1 


0 


0 
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o 
D 


i .0 






2 1 
c. 1 


0 ens 


o 


1 

1 


C 


1.8 






2.1 


0.036 


1 


0 


D 


1.8 






2.1 


0 


1 


0.036 


E 


1.6 




0.2 


2.1 


0.036 


0 


1 


F 


0.2 


1.6 




2.1 






1 


G 


0.2 


1.6 




2.1 


0 


1 


0 


H 


0 


1.6 


0.2 


2.1 




0 


1 



The fermentation tubes were weighed at intervals over the 67 h time course. Loss of weight 
corresponds to evolution of C0 2 during fermentation. The ethanol content of the samples was 
determined after 67 h of fermentation by the DCL ethanol assay method. The kit (catalogue # 

* 

229-29) was purchased from Diagnostic Chemicals Limited, Charlottetown, PE, Canada, DIE 
1B0. Samples (10 (il) were drawn in triplicate from each fermentation tube and diluted into 990 
of water. 10 pi of the diluted samples were mixed with 1.25 ml of a 12.5/1 mixture of assay 
buffer / ADH-NAD reagent. Standards (0, 5, 10, 15 & 20% v/v ETOH) were diluted and 
assayed in parallel. Reactions were incubated at 37 °C for 10 min, then A340 read. Standards 
were prepared in duplicate, samples from each fermentation were prepared in triplicate 
(including the initial dilution). The weight of the samples changed with time as detailed in table 
below. The weight loss is expressed as a percentage of the initial sample weight at time 0. 



Table 22 







Time (h) 






0 


18 


24 I 42 I 48 I 67 


Sample 


Flour Composition 


% wgt loss 


A 


Control 


0.00 


8.09 


9.38 


12.96 


13.83 


16.85 


B 


Control + RXGA 


0.00 


11.48 


14.20 


21.79 


23.83 


24.63 


C 


Control ♦ RXGA + MAMYI 


0.00 


17.90 


23.27 


36.48 


39.07 


47.59 


D 


Control + MAMYI 


0.00 


13.70 


17.72 


28.27 


30.80 


38,27 


E 


Control +RXGA + AMY! flour 


0.00 


16.85 


21.60 


33.95 


36.98 


45.74 


F 


ROGA flour 


0.00 


9.81 


11.74 


16.96 


18.39 


23.17 


G 


ROGA flour + MAMYI 


0.00 


15.53 


19.69 


29.75 


32.11 


39.94 


H 


ROGA flour ♦ AMYI flour 


0.00 


13.35 


16.27 


23.60 


25.53 


31.68 



These data show that the ROGA enzyme expressed in maize increases fermentation rate as 
compared to the no-enzyme control. It also confirms previous data indicating that the AMYI 
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enzyme expressed in maize kernels is a potent activator of fermentation of the starch in corn. 
The ethanol contents are detailed below. 



Table 23 





Flour 


ETOH 


Standard 


Sample 


Composition 


% v/v 


deviation 


A 


Control 


2.09 


0.08 


B 


Control + RXGA 


7.97 


0.18 


C 


Control + RXGA + MAMYI 


13.47 


0.27 


D 


Control + MAMYI 


11.26 


0.12 


E 


Control +RXGA + AMY! flour 


12.28 


0.08 


F 


ROGA flour 


3.55 


0.05 


G 


ROGA flour + MAMYI 


11.29 


0.18 


H 


ROGA flour + AMYI flour 


8.58 


0.13 



These data also demonstrate that expressing Rhizopus oryzae glucoamylase in maize 
facilitates increased fermentation of the starch in corn. Similarly, expression of the barley 
amylase in maize makes corn starch more fermentable with out adding exogenous enzymes. 



Example 55 

Cellobiohvdrolase I 

The Thchoderma reesei cellobiohydrolase I (CBH I) gene was amplified and cloned by 
RT-PCR based on a published database sequence (accession U E00389). The cDNA sequence 
was analyzed for the presence of a signal sequence using the SignalP program, which predicted a 
17 amino acid signal sequence. The DNA sequence encoding the signal sequence was replaced 
with an ATG by PCR, as shown in the sequence (SEQ ID NO: 79). This cDNA sequence was 
used to make subsequent constructs. Additional constructs are made by substituting a maize 
optimised version of the gene (SEQ ID NO: 93). 



Example 56 

Cellobiohvdrolase II 

The Trichoderma reesei cellobiohydrolase II (CBH II) gene was amplified and cloned by 

RT-PCR based on a published database sequence (accession # M55080). The cDNA sequence 

was analyzed for the presence of a signal sequence using the SignalP program, which predicted 
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an 18 amino acid signal sequence. The DNA sequence encoding the signal sequence was 
replaced with an ATG by PCR, as shown in the sequence (SEQ ID NO: 81). This cDNA 
sequence was used to make subsequent constructs. Additional constructs are made by 
substituting a maize optimised version (SEQ ID NO: 94) of the gene. 

Example 57 

Construction of transformation vectors for the Trichoderma reesii cellobiohvdrolase I and 

cellobiohvdrolase II 

Cloning of the Trichoderma reesii cellobiohydrolase I (cWn)cDNA without the native N- 
terminal signal sequence is described in Example 55. Expression cassettes were constructed to 
express the Trichoderma reesii cellobiohydrolase I cDNA in maize endosperm with various 
targeting signals as follows: 

Plasmid 12392 comprises the Trichoderma reesii cbhi cDNA cloned behind the y zein 
promoter for expression specifically in the endosperm for expression in the cytoplasm. 

Plasmid 1 2391 comprises the maize y-zein N-terminal signal sequence 
(MRVLLV ALALLALAAS ATS)(SEQ ID NO: 1 7) fused to Trichoderma reesii cbhi cDNA as 
described above in Example 1 for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1 997). The fusion was cloned behind the y zein promoter for expression 
specifically in the endosperm. 

Plasmidl2392 comprises the y-zein N-terminal signal sequence fused to the Trichoderma 
reesii cbhi cDNA with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned 
behind the maize y zein promoter for expression specifically in the endosperm. 

Plasmid 12656 comprises the waxy amyloplast targeting peptide (Klosgen et al., 1986) 
fused to the Trichoderma reesii cbhi cDNA for targeting to the amyloplast. The fusion was 
cloned behind the maize y zein promoter for expression specifically in the endosperm. 

All expression cassettes were moved into a binary vector (pNOV21 17) for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
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isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

Additional constructs (plasmids 12652,12653,12654 and 12655) were made with the 
targeting signals described above fused to Trichoderma reesii cellobiohydrolasell (cbhii) cDNA 
in precisely the same manner as described for the Trichoderma reesii cbhi cDNA. These fusions 
were cloned behind the maize Q protein promoter (50Kd y zein) (SEQ CD NO: 98) for expression 
specifically in the endosperm and transformed into maize as described above. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable co-transformation. 

Example 58 

Expression of a Cbhi in corn 

TI seed from self-pollinated maize plants transformed with either plasmid 12390, 12391 
or 12392 was obtained. The 12390 construct targets the expression of the Cbhi in the 
endoplasmic reticulum of the endosperm, the 12391 construct targets the expression of the Cbhi 
in the apoplast of the endosperm and the 12392 construct targets the expression of the Cbhi in 
the cytoplasm of the endosperm. 

Extraction and detection of the Cbhi from corn-flour : Polyclonal antibodies to Cbhi and 
CbhTI were produced in goat according to established protocols. Flour from the Cbhi transgenic 
seeds was obtained by grinding them in an Autogizer grinder. Approximately 50 mg of flour 
was resuspended in 0.5ml of 20mM NaP0 4 buffer (pH 7.4),150mM NaCl followed by 
incubation for 15 minutes at RT with continuous shaking. The incubated mixture was then spun 
for lOmin. at 10,000xg. The supernatant was used as enzyme source. 30 \i\ of this extract was 
loaded on a 4-12 % NuPAGE gel (invitrogen) and separated in the NuPAGE MES running 
buffer (invitrogen). Protein was blotted onto nitrocellulose membranes and Western blot 
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analysis was done following established protocols using the specific antibodies described above 
followed by alkaline phosphatase conjugated rabbit antigoat IgG (H+L) . Alkaline phosphatase 
activity was detected by incubation of the membranes with ready to use BCIP/MBT (plus) 
substrate from Moss Inc. 

Western Blot analysis was done of Tl seeds from different events transformed with 
plasmid 1 2390. Expression of Cbhl protein was compared to the non-transgenic control, and 
was detected in a number of events. 

The Cracked Com Assay was performed essentially as described in Example 49, using 
transgenic seed expressing Cbhi. Starch recovery from the transgenic seed was measured and 
the results are set forth in Table 24. 

Table 24. 





Line 3-non expressing control Line 4 


- CBHI expressing 


Conditions 




Starch (mg) 




400ppm S02-No Bromelain 


40.2 




78.1 


400ppmSO2-Plus Bromelain 


48.1 




118.7 


2000ppm S02-No Bromelain 


47.5 




73.1 


2000ppmSO2-Plus Bromelain 


49.2 




109 



Example 59 

Preparation of Endoglucanase I Constructs 

A Trichoderma reesei endoglucanase 1 (EGLI) gene was amplified and cloned by PCR 
based on a published database sequence (Accession U Ml 5665; Penttila et al., 1986). Because 
only genomic sequences could be obtained, the cDNA was generated from the genomic sequence 
by removing 2 introns using Overlap PCR. The resulting cDNA sequence was analyzed for the 
presence of a signal sequence using the SignalP program, which predicted a 22 amino acid signal 
sequence. The DNA sequence encoding the signal sequence was replaced with an ATG by PCR, 
as shown in the sequence (SEQ ED NO: 83). This cDNA sequence was used to make subsequent 
constructs as set forth below. 
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Overlap PCR 

Overlap PCR is a technique (Ho et al., 1989) used to fuse complementary ends of two or 
more PCR products, and can be used to make base pair (bp) changes, add bp, or delete bp. At 
the site of the intended bp change, forward and reverse mutagenic primers (Mut-F and Mut-R) 
are made that contain the intended change and 15 bp of sequence on either side of the change. 
For example, to remove an intron, the primers would consist of the final 15 bp of exon 1 ftised to 
the first 1 5 bp of exon 2. Primers are also prepared that anneal to the ends of the sequence to be 
amplified, e.g ATG and STOP codon primers. PCR amplification of the products proceeds with 
the ATG/Mut-R primer pair and the Mut-F/STOP primer pair in independent reactions. The 
products are gel purified and flised together in a PCR without added primers. The fusion 
reaction is separated on a gel, and the band of the correct size is gel purified and cloned. 
Multiple changes can be accomplished simultaneously through the addition of additional 
mutagenic primer pairs. 

EGLI Plant Expression Constructs 
Expression cassettes were made to express the Trichoderma reesei EGLI cDNA in maize 
endosperm as follows: 

13025 comprises the T. reesei EGLI gene cloned behind the maize y-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

13026 comprises the maize y-zein N-terminal signal peptide (MRVLLVALALLALAASATS) 
fused to the T. reesei EGLI gene for targeting to the endoplasmic reticulum and secretion into the 
apoplast. The fusion was cloned behind the maize y-zein promoter for expression specifically in 
the endosperm. 

13027 comprises the maize y-zein N-terminal signal peptide fused to the T. reesei EGLI gene 
with a C-terminal addition of the sequence KDEL for targeting to and retention in the 
endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter for expression 
specifically in the endosperm. 
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13028 comprises the maize Granule Bound Starch Synthase I (GBSSI) N-terminal signal peptide 
(N-terminal 77 amino acids) fused to the T. reesei EGLI gene for targeting to the lumen of the 
amyloplast. The fusion was cloned behind the maize y-zein promoter for expression specifically 
in the endosperm. 

13029 comprises the maize GBSSI N-terminal signal peptide fused to the T. reesei EGLI gene 
with a C-terminal addition of the starch binding domain (C-terminal 301 amino acids) of the 
maize GBSSI gene for targeting to the starch granule. The fusion was cloned behind the maize 
y-zein promoter for expression specifically in the endosperm. 

Additional Expression cassettes are generated using a maize optimised version of EGLI 
(SEQIDNO:95) 

EGLI Enzyme Assays 

EGLI enzyme activity is measured in maize transgenics using the Malt Beta-Glucanase 
Assay Kit (Cat U K-MBGL) (Megazyme International Ireland Ltd.) The enzymatic activity of 
EGL I expressors is tested in the Corn Fiber Hydrolysis Assay as described in Example 53. 
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Example 60 

ft-Glucosidase 2 

A Trichoderma reesei /3-Glucosidase 2 (BGL2) gene was amplified and cloned by RT- 
PCR based on sequence Accession U AB003 1 1 0 (Takashima et al., 1999). 

BGL2 Plant Expression Constructs 

Expression cassettes were made to express the Trichoderma reesei BGL2 cDNA (SEQ 
ID NO: 89) in maize endosperm as follows: 

13030 comprises the T. reesei BGL2 gene cloned behind the maize 7-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

13031 comprises the maize 7-zein N-terminal signal peptide (MRVLLVALALLALAASATS) 
fused to the T. reesei BGL2 gene for targeting to the endoplasmic reticulum and secretion into 
the apoplast. The fusion was cloned behind the maize 7-zein promoter for expression 
specifically in the endosperm. 

13032 comprises the maize 7-zein N-terminal signal peptide fused to the T. reesei BGL2 gene 
with a C-terminal addition of the sequence KDEL for targeting to and retention in the 
endoplasmic reticulum. The fusion was cloned behind the maize 7-zein promoter for expression 
specifically in the endosperm. 

13033 comprises the maize Granule Bound Starch Synthase I (GBSSI) N-terminal signal peptide 
(N-terminal 77 amino acids) fused to the T. reesei BGL2 gene for targeting to the lumen of the 
amyloplast. The fusion was cloned behind the maize 7-zein promoter for expression specifically 
in the endosperm. 

13034 comprises the maize GBSSI N-terminal signal peptide fused to the T. reesei BGL2 gene 

with a C-terminal addition of the starch binding domain (C-terminal 301 amino acids) of the 

159 



WO 2005/096804 



PCT/US2004/007182 



maize GBSSI gene for targeting to the starch granule. The fusion was cloned behind the maize 
7-zein promoter for expression specifically in the endosperm. 

Additional Expression cassettes are generated by substituting a maize optimized version 

of BGL2 (SEQ ID NO: 96). 

All expression cassettes are inserted into the binary vector pNOV21 17 for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

BGL2 Enzyme Assays 

BGL2 enzyme activity is measured in transgenic maize using a protocol modified from 
Bauer and Kelly (Bauer, M.W. and Kelly, R.M. 1998. The family 1 0-glucosidases from 
Pyrococcus furiosus and Agrobacterium faecalis share a common catalytic mechanism. 
Biochemistry 37: 171 70-17178). The protocol can be modified to incubate samples at 37°C 
instead of 100°C. The enzymatic activity of BGL2-expressors is tested in the Fiber Hydrolysis 
Assay. 
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Example 61 

fl-Glucosidase D 



The Trichoderma reesei /3-Glucosidase D (CEL3D) gene was amplified and cloned by 
PCR based on a published database sequence (accession # AY281378; Foreman et al,, 2003). 
Because only genomic sequences could be obtained, the cDNA was generated from the genomic 
sequence by removing an intron using Overlap PCR, as described in Example 58. The resulting 
cDNA (SEQ ID NO: 91) may be used for subsequent constructs. A maize optimised version 
(SEQ TD NO: 97) of the resulting cDNA may also be used for constructs. 

Plant constructs can be generated and j3-glucosidase assays can be performed as described 
for BGL2 in Example 60, replacing BGL2 with CEL3D. 
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Example 62 

Lipases 

cDNAs encoding lipases are generated using sequences from Accession U D85895, 
AF04488, and AF04489 (Tsuchiya et al., 1996; Yu et al., 2003) and methodology set forth in 
Examples 59-60. 

Lipase enzyme activity can be measured in transgenic maize using the Fluorescent Lipase 
Assay Kit (Cat U M0612)(Marker Gene Technologies, Inc.). Lipase activity can also be 
measured in vivo using the fluorescent substrate l,2-dioleoyW-(pyren-l-yl)decanoy)-rac 
glycerol (M0258), also from Marker Gene Technologies, Inc. 

Example 63 

Expression of Phvtase in Rice 

Vectors 1 1 267 and 1 1 268 comprise binary vectors that encode Nov9x phytase. 
Expression of the Nov9x phytase gene in both vectors is under the control of the rice glutelin-1 
promoter (SEQ ID NO:67). Vectors 1 1267 and 1 1268 are derived from pNOV21 17. 

The Nov9x phytase expression cassette in vector 1 1267 comprises the rice glutelin-1 
promoter, the Nov9x phytase gene with apoplast targeting signal, a PEPC intron, and the 35S 
terminator. The product of the Nov9x phytase coding sequence in vector 1 1267 is shown in SEQ 
ID NO: 110 . 

The Nov9x phytase expression cassette in vector 1 1268 comprises the rice glutelin-1 
promoter, the Nov9x phytase gene with ER retention (SEQ ID NO:l 1 1), a PEPC intron, and the 
35S terminator. The product of the Nov9x phytase coding sequence in vector 1 1268 is shown in 
SEQ ED NO: 112. 
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1 1267 Nov9x phytase with apoplast targeting DNA sequence (SEQ ID NO: 109). 
Translation start and stop codons are underlined. The sequence encoding the signal 

sequence of the 27-kD gamma-zein protein is in bold. 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccaccagcgctgcgcagtccgagccggagctgaagctgg 

agtccgtggtgatcgtgtcccgccacggcgtgcgcgccccgaccaaggccacccagctcatgcaggacgtgaccccggacgcctggcc 

gacctggccggtgaagctcggcgagctgaccccgcgcggcggcgagctgatcgcctacctcggccactactggcgccagcgcctcgtg 

gccgacggcctcctcccgaagtgcggctgcccgcagtccggccaggtggccatcatcgccgacgtggacgagcgcacccgcaagacc 

ggcgaggccttcgccgccggcctcgccccggactgcgccatcaccgtgcacacccaggccgacacctcctccccggacccgctcttcaa 

cccgctcaagaccggcgtgtgccagctcgacaacgccaacgtgaccgacgccatcctggagcgcgccggcggctccatcgccgacttc 

accggccactaccagaccgccttccgcgagctggagcgcgtgctcaacttcccgcagtccaacctctgcctcaagcgcgagaagcagga 

cgagtcctgctccctcacccaggccctcccgtccgagctgaaggtgtccgccgactgcgtgtccctcaccggcgccgtgtccctcgcctcc 

atgctcaccgaaatcttcctcctccagcaggcccagggcatgccggagccgggctggggccgcatcaccgactcccaccagtggaacac 

cctcctctccctccacaacgcccagttcgacctcctccagcgcaccccggaggtggcccgctcccgcgccaccccgctcctcgacctcatc 

aagaccgccctcaccccgcacccgccgcagaagcaggcctacggcgtgaccctcccgacctccgtgctcttcatcgccggccacgacac 

caacctcgccaacctcggcggcgccctggagctgaactggaccctcccgggccagccggacaacaccccgccgggcggcgagctggt 

gttcgagcgctggcgccgcctctccgacaactcccagtggancaggtgtccctcgtgttccagaccctccagcagatgcgcgacaagacc 

ccgctctccctcaacaccccgccgggcgaggtgaagctcaccctcgccggctgcgaggagcgcaacgcccagggcatgtgctccctcg 

ccggcttcacccagatcgtgaacgaggcccgcatcccggcctgctccctctaa 



11267 Nov9x phytase with apoplast targeting gene product (SEQ ID NO:110). The signal 

sequence of the 27-kD gamma-zein protein is in bold. 

mrvllvalallalaasatsaaqsepelkleswivsrhgvrapto 

dgllpkcgcpqsgqvaiiadvdertrktgeafaaglapdcaitvhtqadtsspdplfo^ 

qtafrelervlnfpqsnlcllaekqdescsltqalpselkvsadcvsltgavslasmlteifllqqaqgmpepgwgri 

aqfdllqrtpevarsratplldliktaltphppqkqaygvtlptsvlfiaghdtnlanlggalelnwtlpgqpdntppgge 

sqwiqvslvfqtlqqmrdktplslntppgevkltlagceemaqgmcslagflqivnearipacsl 

11268 Nov9x phytase with ER retention DNA sequence (SEQ ID NO:lll). The sequence 
encoding the signal sequence of the 27-kD gamma-zein protein is in bold. The sequence 

encoding the SEKDEL hexapeptide ER retention signal is underlined. 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccaccagcgctgcgcagtccgagccggagctgaagctgg 

agtccgtggtgatcgtgtcccgccacggcgtgcgcgccccgaccaaggccacccagctcatgcaggacgtgaccccggacgcctggcc 

gacctggccggtgaagctcggcgagctgaccccgcgcggcggcgagctgatcgcctacctcggccactactggcgccagcgcctcgtg 

gccgacggcctcctcccgaagtgcggctgcccgcagtccggccaggtggccatcatcgccgacgtggacgagcgcacccgcaagacc 

ggcgaggccttcgccgccggcctcgccccggactgcgccatcaccgtgcacacccaggccgacacctcctccccggacccgctcttcaa 

cccgctcaagaccggcgtgtgccagctcgacaacgccaacgtgaccgacgccatcctggagcgcgccggcggctccatcgccgacttc 

accggccactaccagaccgccttccgcgagctggagcgcgtgctcaacttcccgcagtccaacctctgcctcaagcgcgagaagcagga 

cgagtcctgctccctcacccaggccctcccgtccgagctgaaggtgtccgccgactgcgtgtccctcaccggcgccgtgtccctcgcctcc 

atgctcaccgaaatcttcctcctccagcaggcccagggcatgccggagccgggctggggccgcatcaccgactcccaccagtggaacac 

cctcctctccctccacaacgcccagttcgacctcctccagcgcaccccggaggtggcccgctcccgcgccaccccgctcctcgacctcatc 

aagaccgccctcaccccgcacccgccgcagaagcaggcctacggcgtgaccctcccgacctccgtgctcrtcatcgccggccacgacac 

caacctcgccaacctcggcggcgccctggagctgaactggaccctcccgggccagccggacaacaccccgccgggcggcgagctggt 

gttcgagcgctggcgccgcctctccgacaactcccagtggattcaggtgtccctcgtgttccagaccctccagcagatgcgcgacaagacc 
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ccgctctccctcaacaccccgccgggcgaggtgaagctcaccctcgccggctgcgaggagcgcaacgcccagggcatgtgctccctcg 
cr.Pgcttrnr.rxagatcgtgaacgaggcccgcatcccggcctgctccctc tccpagaagpacgagctg taa 

1 1268 Nov9x phytase with ER retention, gene product (SEQ ID NO: 112). The signal 
sequence of the 27-kD gamma-zein protein is in bold. The ER retention signal is 

underlined. 

mrvllvalallalaasatsaaqsepelkleswivsrhgv^ 
dgllpkcgcpqsgqvaiiadvdertrktgeafaaglapdcaitvh^ 
qtafrelervlnfpqsnlclkrekqdescsltqalpse 
aqfdliqrtpevarsratplldliktaltphppqkqaygvtlptsvlfi^ 

sqwiqvslvfqtlqqmrdktplslntppgevkltlagceemaqgrncslagftqivnearipacslsek^ 

Generation of transgenic rice plants 

Rice (Oryza saliva) is used for generating transgenic plants. Various rice cultivars can be 
used (Hiei et al., 1994, Plant Journal 6:271-282; Dong et al., 1996, Molecular Breeding 2:267- 
276; Hiei et al., 1997, Plant Molecular Biology, 35:205-218). Also, the various media 
constituents described below may be either varied in concentration or substituted. Embryogenic 
responses are initiated and/or cultures are established from mature embryos by culturing on MS- 
CIM medium (MS basal salts, 4.3 g/liter; B5 vitamins (200 x), 5 ml/liter; Sucrose, 30 g/liter; 
proline, 500 mg/Iiter; glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2,4-D (1 mg/ml), 
2 ml/liter; adjust pH to 5.8 with 1 N KOH; Phytagel, 3 g/liter). Either mature embryos at the 
initial stages of culture response or established culture lines are inoculated and co-cultivated with 
the Agrobacterium strain LBA4404 containing the desired vector construction. Agrobacterium is 
cultured from glycerol stocks on solid YPC medium (100 mg/L spectinomycin and any other 
appropriate antibiotic) for -2 days at 28 °C. Agrobacterium is re-suspended in liquid MS-CIM 
medium. The Agrobacterium culture is diluted to an OD600 of 0.2-0.3 and acetosyringone is 
added to a final concentration of 200 uM. Agrobacterium is induced with acetosyringone before 
mixing the solution with the rice cultures. For inoculation, the cultures are immersed in the 
bacterial suspension. The liquid bacterial suspension is removed and the inoculated cultures are 
placed on co-cultivation medium and incubated at 22°C for two days. The cultures are then 
transferred to MS-CEM medium with Ticarcillin (400 mg/liter) to inhibit the growth of 
Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., In Vitro 
Cell. Dev. Biol. -Plant 37:127-132), cultures are transferred to selection medium containing 
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Mannose as a carbohydrate source (MS with 2%Mannose, 300 mg/liter Ticarcillin) after 7 days, 
and cultured for 3-4 weeks in the dark. Resistant colonies are then transferred to regeneration 
induction medium (MS with no 2,4-D, 0.5 mg/liter IAA, 1 mg/liter zeatin, 200 mg/liter 
Ticarcillin 2% Mannose and 3% Sorbitol) and grown in the dark for 14 days. Proliferating 
colonies are then transferred to another round of regeneration induction media and moved to the 
light growth room. Regenerated shoots are transferred to GA7-1 medium (MS with no hormones 
and 2% Sorbitol) for 2 weeks and then moved to the greenhouse when they are large enough and 
have adequate roots. Plants are transplanted to soil in the greenhouse and grown to maturity. 

Example 64 

Analysis of Transgenic Rice Seed Expressing Nov9X Phvtase 

ELISA For The Quantitation Of Nov9X Phytase From Rice Seed 

Quantitation of phytase expressed in transgenic rice seed was assayed by ELISA. One 
(lg) rice seed was ground to flour in a Kleco seed grinder. 50 mg of flour was resuspended in 
the sodium acetate buffer described in example - for Nov9X phytase activity assay and diluted 
as required for the immunoassay. The Nov9X immunoassay is a quantitative sandwich assay for 
the detection of phytase that employs two polyclonal antibodies. The rabbit antibody was 
purified using protein A, and the goat antibody was immunoaffinity purified against recombinant 
phytase (No v9X) protein produced in E.coli inclusion bodies. Using these highly specific 
antibodies, the assay can measure picogram levels of phytase in transgenic plants. There are three 
basic parts to the assay. The phytase protein in the sample is captured onto the solid phase 
microtiter well using the rabbit antibody. Then a "sandwich" is formed between the solid phase 
antibody, the phytase protein, and the secondary antibody that has been added to the well. After 
a wash step, where unbound secondary antibody has been removed, the bound antibody is 
detected using an alkaline phosphatase-labeled antibody. Substrate for the enzyme is added and 
color development is measured by reading the absorbance of each well. The standard curve uses 
a four-parameter curve fit to plot the concentrations versus the absorbance, 

Phytase activity assay 
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Determination of phytase activity, based upon the estimation of inorganic phosphate 
released on hydrolysis of phytic acid, can be performed at 37°C following the method of 
Engelen, A.J. et al., J. AOAC Inter., 84 , 629 (2001). One unit of enzyme activity is defined as 
the amount of enzyme that liberates 1 fimol of inorganic phosphate per minute under assay 
conditions. For example, phytase activity may be measured by incubating 2.0 ml of the enzyme 
preparation with 4.0 ml of 9.1 mM sodium phytate in 250 mM sodium acetate buffer pH 5.5, 
supplemented with 1 mM CaC12 for 60 minutes at 37°C. After incubation, the reaction is 
stopped by adding 4.0 ml of a color-stop reagent consisting of equal parts of a 10% (w/v) 
ammonium molybdate and a 0.235% (w/v) ammonium vanadate stock solution. Precipitate is 
removed by centrifugation, and phosphate released is measured against a set of phosphate 
standards spectrophotometrically at 415 nm. Phytase activity is calculated by interpolating the 
A41 5 nm absorbance values obtained for phytase containing samples using the generated 
phosphate standard curve. 

This procedure may be scaled down to accommodate smaller volumes and adapted to 
preferred containers. Preferred containers include glass test tubes and plastic microplates. 
Partial submersion of the reaction vessel(s) in a water bath is essential to maintain constant 
temperature during the enzyme reaction. 



Table 24 



Trans-genic 
line 


Mg 
phytase/g 

flour* 


Phytase activity 
units per g flour** 


Endogenous inorganic 
phosphate released by 
cooking of dehusked rice 
seed (limol/gseed) 


Endogenous inorganic 
phosphate released by cooking 
of dehusked, polished rice 
seed (umol/gseed) 


Wild type 


0 


0 


1.442 


0.469 


1 


510 


916 


1.934 


0.840 


2 


1518 


2800 


2.894 


1.073 



*Mg phytase was assayed by a sandwich ELISA 

** Phytase activity was assayed by Phytase activity assay as described above. 



Assay of Inorganic Phosphate Release During Cooking of Transgenic Rice Expressing Phytase 
Two samples of Ig seed from selected rice transgenic lines and a control wildtype line 
was dehusked using a benchtop Kett TR200 automatic rice husker . One sample was then 
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polished for 30 seconds in a Kett Rice polisher. Two volumes of H20 was added to each 
sample and the rice was cooked by immersing the tubes into a beaker of water. The water was 
brought to a boil and held in a full rolling boil for 10 minutes. The "cooked" rice seed was then 
ground to a paste with water bringing the total volume of te slurry to 6 ml. The slurry was 
centriftiged at 15,000xg for 10 minutes and the clear supernatant assayed for released 
endogenous inorganic phosphate, The assay of released phosphate is based on color formation as 
a result of molybdate and vanadate ions complexing with inorganic phosphate and is measured 
spectrophotometrically at 4 1 5nm as described in example - for phytase enzymatic activity. The 
results are in Table 24. 

All publications, patents and patent applications are incorporated herein by 
reference. While in the foregoing specification this invention has been described in relation to 
certain preferred embodiments thereof, and many details have been set forth for purposes of 
illustration, it will be apparent to those skilled in the art that the invention is susceptible to 
additional embodiments and that certain of the details described herein may be varied 
considerably without departing from the basic principles of the invention. 

SEQUENCE LISTING 
<110> Lanahan, Mike 

<120> Self -processing Plants and Plant Parts 
<130> 109846.317 

<140> US 60/315, 281 
<141> 2001-08-27 

<160> 60 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 436 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 1 
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1 CC 
13 3 










160 

1 U u 


ion 


lj y o 


Ser 

w C X 


Trp 


Asp 


Gin 


Tvr 

j. y x 


Tro 




1 ' V 










1 7C 

X f 3 




Ala 

r\ 1 a 


Ala 


Tvr 
i y x 


Leu 


Arg 


Ser 


He 


Glv 

\JX / 


10 3 










190 






Va 1 
val 


i-jy o 


nl v 

ui y 


Tyr Gly Ala 




Val 

v a x 










205 








u i y 


vj x y 


1 x p 


Ala 


Val 


Gly 


nlu 


Tvr 
i yx 








220 










Lieu 


Asn 


Trip 

1 X u> 


Ala 


Tyr 


Ser 


Ser 


Glv 

wx j 






? 2 5 
£ j j 










240 


Tvr 
i y x 


Tvr 
a y t 


Lvs 


Met Asp Ala 


Ala 


Phe 




Ten 
C 3 v 










255 




Va 1 
v a x 


Glu 


Ala 


Leu 


Lys 


Asn 


Glv 

vjx y 


Glv 

\jxy 












270 






uy o 


Ala 

n x ci 


Val 

v n x 


Thr 


Phe 


Val 


Ala 


Asn 










285 








uy o 


lyi 


Prn 

i X w 


Ala 


Tyr 


Ala 


Phe 


He 








300 










He 


Phe 


Tvr 


Arg 


Asp 


Tyr 


Glu 


Glu 






J X 3 










320 

W 4> V 


As n 


Leu 

W4 


He 

* ^ 


Trp 


He 


His 


Asp 


Asn 














335 
-j ^ 




Val 


Tvr 

4 y x 


Tvr 


Asp 


Ser 


Asp 


Glu 


Met 


34 5 










350 






Ser 


LVS 


Pro 


Gly 


Leu 


He 


Thr 


Tyr 










365 








Gly 


Arg 


Trp 


Val 


Tyr 


val 


Pro 


Lys 








380 










Tyr 


Thr 


Gly 


Asn 


Leu Gly 


Gly 


Trp 






395 










400 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 




410 










415 




Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


425 










430 
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Cys Gly val Gly 
435 



<210> 2 
<2U> 1308 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 2 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg gcatgtccgg gggctactcg 180 
atgggctacg acccgtacga ctacttcgac ctcggcgagt actaccagaa gggcacggtg 240 
gagacgcgct tcgggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 
ggcatcaagg tcatcgcgga catcgtgatc aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg tcggcgacta cacctggacg gacttctcca aggtcgcctc cggcaagtac 420 
accgccaact acctcgactt ccaccccaac gagctgcacg cgggcgactc cggcacgttc 480 
ggcggctacc cggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc atcggcatcg acgcgtggcg cttcgactac 600 
gtcaagggct acggggcctg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggccta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 
atcccggcgc tcgtcgaggc cctgaagaac ggcggcacgg tggtctcccg cgacccgttc 840 
aaggccgtga ccttcgtcgc caaccacgac acggacatca tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta cgagggccag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct cggcggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtacggcta ctccgtgtgg tcctactgcg gcgtcggc 1308 

<210> 3 
<211> 800 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 3 



Met 


Gly 


His 


Trp 


Tyr 


Lys 


His 


Gin 


Arg 


Ala Tyr 


Gin 


Phe 


Thr 


Gly 


Glu 


1 






5 










10 








15 




Asp 


Asp 


Phe 


Gly 


Lys 


Val 


Ala 


Val 


Val 


Lys Leu 


Pro 


Met 


Asp 


Leu 


Thr 




20 










25 








30 






Lys 


Val 


Gly 


lie 


He 


val 


Arg 


Leu 


Asn 


Glu Trp 


Gin 


Ala 


Lys 


Asp 


Val 




35 










40 








45 








Ala 


Lys 
50 


Asp 


Arg 


Phe 


He 


Glu 
55 


lie 


Lys 


Asp Gly 


Lys 
60 


Ala 


Glu 


Val 


Trp 


lie 


Leu 


Gin 


Gly 


Val 


Glu 


Glu 


He 


Phe 


Tyr Glu 


Lys 


Pro 


Asp 


Thr 


Ser 
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65 

Pro Arg lie Phe 

Phe Leu Thr Asn 

100 

Thr Val Asp Gly 
115 

Pro Thr Asp lie 
130 

Ser Leu Lys Glu 
145 

Gly Tyr Lys Pro 

Tyr Tyr Asp Gly 

180 

Phe Arg Val Trp 
195 

Lys Asn Gly Glu 
210 

Lys Gly Asn Gly 
225 

val Phe Tyr Leu 

Val Asp Pro Tyr 

260 

Val Val Asn Leu 
275 

Gly Pro Lys He 
290 

He Ala Asp He 

305 

Leu Tyr Leu Gly 

Thr Thr Gly Leu 

340 

lie Leu Pro Phe 
355 

Phe Glu Lys Tyr 
370 

Pro Glu Gly Arg 
385 

Arg Glu Val Lys 

Val lie Met Asp 

420 

Ser Ala Phe Asp 
435 

Thr Gly Ala Tyr 
450 

Glu Arg Pro Met 
465 

Val Lys Glu Tyr 
He Asp Lys Lys 





70 






pne 


Ala 


Gin 


Ala 


85 








Pro 


va l 


Asp 


Tnr 


Lys 


Glu 


He 


Pro 








120 


Asp 


Val 


Thr 


Asn 






135 




Glu 


Asp 


Leu 


Arg 




150 






Ala 


Arg 


Val 


He 


165 








Glu 


Leu 


Gly 


Ala 


Ser 


Pro 


Val 


Ser 








200 


Asp 


Thr 


Glu 


Pro 






215 




Val 


Trp 


Glu 


Ala 




230 






Tyr 


Gin 


• 

Leu 


Glu 


245 








Ser 


Lys 


Ala 


Val 


Ala 


Arg 


Thr 


Asn 








280 


Glu 


Gly 


Tyr 


Glu 






295 




Thr 


Gly 


Leu 


Glu 




310 






Leu 


Thr 


Glu 


Glu 


325 








Ser 


His 


Leu 


Val 


Phe 


Asp 


Phe 


Tyr 








360 


Tyr 


Asn 


Trp 


Gly 






375 




Tyr 


Ser 


Thr 


Asp 




390 






Glu 


Met 


va l 


Lys 


405 








Met 


va i 


pne 


T~*J V* i"N 

pro 


Gin 


Thr 


Val 


Pro 








440 


Leu 


Asn 


Glu 


Ser 






455 




Met 


Arg 


Lys 


Phe 




470 






His 


lie 


Asp 


Gly 


485 








Thr 


Met 


Leu 


Glu 







75 




Arg 


Ser 


fV £9 T"\ 

Asn 


uys 




90 






Lys 


Lys 


Lys 


Glu 


105 








Val 


Ser 


Arg 


va l 


Tyr 


Val 


Arg 


He 








140 


Lys 


Asp 


17-1 

Val 


Glu 






155 




Met 


Met 


Glu 


He 




170 






va l 


Tyr 


Ser 


Pro 


185 








Lys 


Trp 


Val 


Lys 


Tyr 


Gin 


Val 


Val 








220 


Val 


Val 


Glu 


Gly 






235 




Asn 


Tyr 


Gly 


Lys 




250 






Tyr 


Ala 


Asn 


Asn 


265 








Pro 


Glu 


Gly 


Trp 


Asp 


Ala 


He 


He 








300 


Asn 


Ser 


Gly 


Val 






315 




Asn 


Thr 


Lys 


Gly 




330 






Glu 


Leu 


Gly 


Val 


345 








Thr 


Gly 


ASp 


GlU 


Tyr 


Asp 


Pro 


Tyr 








380 


Pro 


Lys 


Asn 


Pro 






395 




Ala 


Leu 


His 


T • r mm 

Lys 




410 






HIS 


Tnr 


Tyr 


oiy 


42 5 








Tyr 


Tyr 


pne 


Tyr 


Gly 


Cys 


Gly 


Asn 








460 


He 


Val 


Asp 


Thr 






475 




Phe 


Arg 


Phe 


Asp 




490 






val 


Glu 


Arg 


Ala 



80 

Val He Glu Ala 
95 

Leu Phe Lys Val 
110 

Glu Lys Ala Asp 
125 

Val Leu Ser Glu 

Leu He He Glu 

160 

Leu Asp Asp Tyr 
175 

Glu Lys Thr He 
190 

Val Leu Leu Phe 
205 

Asn Met Glu Tyr 

Asp Leu Asp Gly 

240 

He Arg Thr Thr 
255 

Gin Glu Ser Ala 
270 

Glu Asn Asp Arg 
285 

Tyr Glu He His 

Lys Asn Lys Gly 

320 

Pro Gly Gly Val 
335 

Thr His Val His 
350 

Leu Asp Lys Asp 
365 

Leu Phe Met Val 

His Thr Arg He 

400 

His Gly He Gly 
415 

He Gly Glu Leu 
430 

Arg He Asp Lys 
445 

val He Ala Ser 

Val Thr Tyr Trp 

480 

Gin Met Gly Leu 
495 

Leu His Lys He 
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500 










505 








Pro 


Thr 


lie 


lie 


Leu 


Tvr 
y 


Gly Glu 


Pro 


Trp 






515 










520 








Pro 


lie 


Arg 


Phe Gly 


ujr 9 


Spr 
o ^ i- 


Asp 


Val 


Ala 


Gly 














nt: 
j j j 










Phe 

r lie 


Asn 


Asp Glu 


Phe 


Ara 


Asd 


Ala 


lie 


Arg Gly 












^ ^ 0 










555 


Cay- 

OCA. 


Val 

V CI X 


Lys 


Gly 


Phe 


Val 


Met 


Gly Gly 


Tyr Gly 










56 5 

•mf \J <J 










R70 




Ity S 


Arg 


Gly Val 


Val 


\j±y 


Car 
Oct 


lie 


/AO 1 1 


Tvr 










con 

jOvi 










c; ft r 

JO J 






Ser 


Phe 


Al a 


Leu 


ASD 


rXU 




fjlii 
\JXU 


Th r 

XIII. 


Tie 

J. J- c 


noil 






CQC 


















Asp 


Asn 


nib 


Thr 


Le 1} 


i rp 


Mo LJ 


T iV q 


A s n 

noil 


Tv r 


Leu 




610 










Ol3 










Lys 


Lys 


T \ / c 
Lyb 


Glu 




Thr 


m n 

vj j. u 


m ii 

Ui U 




T ,pi i 


T,VC 


625 










o J U 












Ala 


Gly 


hla 


He 


Leu 


T oil 
Leu 


l n i 


Ser 


Gin 


wiy 


V Q X 










645 










D 3U 




Gly Gin 


Asp 


Phe 


Cys 


Mrg 


Th t- 


Thr 


Asn 




A c n 
Ms 1 1 








660 










665 






Ala 


Pro 


lie 


Ser 


He 


Ann 

Mall 


vj i y 


Phe 


Asp 


Tyr 


Glu 






675 










680 








lie 


Asp 


Val 


Phe 


Asn 


Tyr 


His 


Lys 


Gly 


Leu 


He 




690 










695 










His 


Pro 


Ala 


Phe 


Arg 


Leu 


Lys 


Asn 


Ala 


Glu 


Glu 


705 










710 










715 


Glu 


Phe 


Leu 


Pro 


Gly 


Gly 


Arg 


Arg 


lie 


Val 


Ala 










725 










730 




His 


Ala 


Gly 


Gly Asp 


Pro 


Trp 


Lys 


Asp 


lie 


Val 








740 










745 






Asn 


Leu 


Glu 


Lys 


Thr 


Thr Tyr 


Lys 


Leu 


Pro 


Glu 






755 










760 








Val 


Val 


Asn 


Ser 


Gin 


Lys 


Ala 


Gly Thr 


Glu 


Val 




770 










775 










Gly 


Thr 


lie 


Glu 


Leu 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


785 










790 










795 



<210> 4 
<211> 2400 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 4 

atgggccact ggtacaagca ccagcgcgcc taccagttca 
aaggtggccg tggtgaagct cccgatggac ctcaccaagg 
aacgagtggc aggegaagga cgtggccaag gaccgcttca 
gccgaggtgt ggatactcca gggcgtggag gagatcttct 
ccgcgcatct tcttcgccca ggcccgctcc aacaaggtga 
ccggtggaca ccaagaagaa ggagctgttc aaggtgaccg 

171 



10/591870 

IAP1 2 Rec'd PCT/PTO 07SEP 2006 

PCTYUS2004/007182 

510 



Glv 


Gly 


Trp 


Gly 


Ala 




525 








Thr 


His 


Val 


Ala 


Ala 


540 










Ser 


Val 


Phe 


Asn 


Pro 










560 

^ w w 


Lvs 


Glu 


Thr 


LV9 


He 

X X 








575 




Glv 


Lvs 


Leu 

w v* 


lie 


Lvs 

X*Jr w 






590 






Tvr 


Ala 


Ala 


Pvs 


His 

It X o 




605 








Ala 

n x c* 


Ala 


Lvs 


Ala 

r» X d 


Asn 


620 










Asn 


Ala 


Gin 

VJ 111- 


Lvs 


i-J v^- U 










640 


Drft 


Phe 

rue 


T.pii 

LJ \Z U 


Hi s 


Gl v 
vj x y 








655 

w ™* w 




Aen 
MO kJ 


Ann 
no 1 1 


Car 
O C L 


iyr 


A c n 

MO 11 






670 






A ra 

mi. y 


T.vq 

1-1 y O 


TiPi i 


m n 


Phe 




685 

V %r -v^ 








T ,\/ a 
Liy o 


T.en 
i_»c u 


Am 
Mx y 




Ui u 


700 










He 

X X w 


Lvs 


Lvs 


His 

It X o 


Leu 










720 


Phe 


Met 


Leu 


Lys 


Asp 








735 




Val 


lie 


Tyr 


Asn 


Gly 






750 






Gly 


Lys 


Trp 


Asn 


val 




765 








He 


Glu 


Thr 


val 


Glu 


780 










Val 


Leu 


Tyr 


Arg 


Glu 



800 



ccggcgagga egactteggg 60 
tgggcatcat cgtgcgcctc 120 
tcgagatcaa ggacggcaag 180 
aegagaagee ggacacctcc 240 
tegaggcett cctcaccaac 300 
tegaeggcaa ggagatcccg 360 
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gtgtcccgcg tggagaaggc cgacccgacc gacatcgacg tgaccaacta cgtgcgcatc 420 
gtgctctccg agtccctcaa ggaggaggac ctccgcaagg acgtggagct gatcatcgag 480 
ggctacaagc cggcccgcgt gatcatgatg gagatcctcg acgactacta ctacgacggc 540 
gagctggggg cggtgtactc cccggagaag accatcttcc gcgtgtggtc cccggtgtcc 600 
aagtgggtga aggtgctcct cttcaagaac ggcgaggaca ccgagccgta ccaggtggtg 660 
aacatggagt acaagggcaa cggcgtgtgg gaggccgtgg tggagggcga cctcgacggc 720 
gtgttctacc tctaccagct ggagaactac ggcaagatcc gcaccaccgt ggacccgtac 780 
tccaaggccg tgtacgccaa caaccaggag tctgcagtgg tgaacctcgc ccgcaccaac 840 
ccggagggct gggagaacga ccgcggcccg aagatcgagg gctacgagga cgccatcatc 900 
tacgagatcc acatcgccga catcaccggc ctggagaact ccggcgtgaa gaacaagggc 960 
ctctacctcg gcctcaccga ggagaacacc aaggccccgg gcggcgtgac caccggcctc 1020 
tcccacctcg tggagctggg cgtgacccac gtgcacatcc tcccgttctt cgacttctac 1080 
accggcgacg agctggacaa ggacttcgag aagtactaca actggggcta cgacccgtac 1140 
ctcttcatgg tgccggaggg ccgctactcc accgacccga agaacccgca cacccgaatt 1200 
cgcgaggtga aggagatggt gaaggccctc cacaagcacg gcatcggcgt gatcatggac 1260 
atggtgttcc cgcacaccta cggcatcggc gagctgtccg ccttcgacca gaccgtgccg 1320 
tactacttct accgcatcga caagaccggc gcctacctca acgagtccgg ctgcggcaac 1380 
gtgatcgcct ccgagcgccc gatgatgcgc aagttcatcg tggacaccgt gacctactgg 1440 
gtgaaggagt accacatcga cggcttccgc ttcgaccaga tgggcctcat cgacaagaag 1500 
accatgctgg aggtggagcg cgccctccac aagatcgacc cgaccatcat cctctacggc 1560 
gagccgcggg gcggctgggg ggccccgatc cgcttcggca agtccgacgt ggccggcacc 1620 
cacgtggccg ccttcaacga cgagttccgc gacgccatcc gcggctccgt gttcaacccg 1680 
tccgtgaagg gcttcgtgat gggcggctac ggcaaggaga ccaagatcaa gcgcggcgtg 1740 
gcgggcccca tcaactacga cggcaagctc atcaagtcct tcgccctcga cccggaggag 1800 
accatcaact acgccgcctg ccacgacaac cacaccctct gggacaagaa ctacctcgcc 1860 
gccaaggccg acaagaagaa ggagtggacc gaggaggagc tgaagaacgc ccagaagctc 192 0 
gccggcgcca tcctcctcac tagtcagggc gtgccgttcc tccacggcgg ccaggacttc 1980 
tgccgcacca ccaacttcaa cgacaactcc tacaacgccc cgatctccat caacggcttc 2040 
gactacgagc gcaagctcca gttcatcgac gtgttcaact accacaaggg cctcatcaag 2100 
ctccgcaagg agcacccggc cttccgcctc aagaacgccg aggagatcaa gaagcacctg 2160 
gagttcctcc cgggcgggcg ccgcatcgtg gccttcatgc tcaaggacca cgccggcggc 2220 
gacccgtgga aggacatcgt ggtgatctac aacggcaacc tggagaagac cacctacaag 2280 
ctcccggagg gcaagtggaa cgtggtggtg aactcccaga aggccggcac cgaggtgatc 2340 
gagaccgtgg agggcaccat cgagctggac ccgctctccg cctacgtgct ctaccgcgag 2400 



<210> 5 

<2ll> 693 

<212> PRT 

<213> Sulfolobus solfataricus 

<400> 5 



Met Glu 


Thr 


lie 


Lys 


He 


Tyr 


Glu 


Asn 


Lys 


Gly 


Val 


Tyr 


Lys 


Val 


Val 


1 






5 










10 










15 




lie Gly 


Glu 


Pro 


Phe 


Pro 


Pro 


lie 


Glu 


Phe 


Pro 


Leu 


Glu 


Gin 


Lys 


He 




20 










25 










30 






Ser Ser 


Asn 


Lys 


Ser 


Leu 


Ser 


Glu 


Leu 


Gly 


Leu 


Thr 


He 


Val 


Gin 


Gin 




35 










40 










45 








Gly Asn 


Lys 


Val 


He 


Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 


Glu 


His 


He 


50 










55 










60 










He Gly 


Leu 


Gly 


Glu 


Lys 


Ala 


Phe 


Glu 


Leu 


Asp 


Arg 


Lys 


Arg 


Lys 


Arg 


65 








70 










75 










80 


Tyr val 


Met 


Tyr 


Asn 


Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Asp 



85 90 95 
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Pro 


Leu 


Tyr 


Val 








100 


h 1 a 


T" Vi v 




iyr 






lib 




p T « r 

uiy 


ueu 


LjIU 


pin 




1 ^ A 






V 3 1 


pin 
U1U 


r ne 


Tyr 


14 5 










Tyr 


1 Hi 


ulu 


Phe 


Gly 


Tyr 


Met 








180 


va J. 


P T i i 

blU 


Leu 


va i 






195 




val 


p> i_ 

Pne 


Leu 


Asp 




210 






It i n 

HIS 


HtO 


lyr 


Arg 


"s o r* 

2 25 








Lys 


Arg 


As n 


va i 


Asp 


Gin 


Asn 


Tyr 








260 


lie 


GlU 


Ser 


Gly 






275 




Val 


Tyr 


Pro 


Asp 




2 90 






Leu 


lie 


O a 

oc r 


pin 


3 05 










A £• 

as n 


\j 1 u 




Leu 


Ser 


Ser 


Leu 








340 


Pne 


Pro 


Asp 


Asn 






^ c c 

3 5 5 




Glu 


Lys 


va i 


Arg 




370 






Lys 


uiy 


n Vi a 
rne 


Arg 


*i n c 

3 85 








Ala 


u iy 


Tyr 


a i a 


Asn 


Thr 


Pro 


Ser 








420 


<j i y 


bcU 


JCI 


Tip 

IXC 






/lie 
4 ,} D 




r ne 


uin 


fl 1 w 

uiy 


hi g 




450 






Val 


Lys 


Tyr 


Tyr 


465 








Lys 


Ala 


Thr 


Asp 


Tyr 


Lys 


GlU 


Lys 








500 


Pro 


Tyr 


He 


Tyr 



515 



Ser He Pro Leu 

Phe Phe Asn Ser 

120 

Tyr Asp Lys Val 
135 

Val He Glu Gly 
150 

Leu Thr Gly Lys 
165 

lie Ser Arg Tyr 

Asp He Met Gin 

200 

He His Tyr Met 
215 

Phe Pro Glu Pro 
230 

Lys Leu He Thr 
245 

Ser Pro Phe Leu 

Glu Leu Phe Val 

280 

Phe Phe Arg Glu 
295 

Trp Leu Ser Gin 
310 

Thr Asp Phe Ser 
325 

Pro Val Gin Phe 

Val Val His Tyr 

360 

Asn Ala Tyr Pro 
375 

Thr Ser His Arg 
390 

Gly He Gin Arg 
405 

Trp Asp Asp Leu 
Ser Gly Val Pro 

440 

Asn Phe Ala Glu 
455 

Ala Leu Ala Leu 
470 

Gly He Asp Thr 
485 

Val Lys Glu He 

Ser Leu Ala Leu 

520 



rue 


Tip 

11C 




Val 


l yj D 








Ala 


Ser 


LVS 


Val 


lie 


Val 


Thr 


lie 








140 


~ i. \j 


nl y 


He 
1 1 w 


Glu 






13 3 




P >-p 


Php 

rue 


TiP 11 


Pro 




i / u 






Q o y 


iyr 


iyr 


Pro 


1 □ C 
1 O 0 








T a/ e 


Vj 1 u 


o i y 


Php 


Asp 


Ser 


Tyr 


Lys 








220 


T .\/c 


jjy o 


T ,P 1 1 

Lj vT U 


t i e 

iic 






O c> 




TIP 


Va 1 
»oi 


Asn 

no w 


His 
1113 




z D U 






OCI 


ijiy 




Glv 
vji y 










Glv 
uiy 


J-ijr O 


Met 


Tro 


Asp 


Thr 


Arg 


Glu 








300 


pi \; 

vjiy 


v a l 


Asp 


Gly 






315 




Arg 


a jl a 


lie 


Glu 




^ ^ a 






A y-p 


A en 


Asp 


Arg 










T .pi i 


t\x y 


Gly Lys 


Leu 


Tyr 


Glu 


Ala 








380 


noil 


m ii 


He 


Phe 






395 




Tyr 


a 1 a 
Aid 


Phe 


lie 




A 1 f\ 

410 






uys 


T on 


Gin 


Leu 










Php 

rile 


Va 1 
v a i 


Gly Cys 


I le 


Asp 


Asn 


Ser 








460 


Phe 


Phe 


Pro 


Phe 






475 




Glu 


Pro 


Val 


Phe 




490 






Val 


Glu 


Leu 


Arg 


505 








Glu 


Ala 


Ser 


Glu 



173 



LiVS 




Glv 
oiy 


Val 

v a x 




1 1 V 






lie 


Phe 


ASD 


Val 


1 j 








Pro 


Glu 

vl U 


Asn 


Ser 


Asp 


Val 


Leu 


Glu 








160 


Pro 

It 1 w 


rlt: L 


i irp 


Ala 

PI CI 






I'D 




Vj 1 II 




jjy 9 


Va 1 
v d 1 




i a a 

190 






a ry 


\/a 1 
V a 1 


Aid 


P]w 

vj i y 


205 








Leu 


rne 


i nr 


irp 


Asp 


Glu 


Leu 


His 








240 


sji y 


Tip 
lie 


A yp 

Ai y 


Va 1 
v d i 






OCC 




L*y o 


Php 
rile 


P\/ Q 


Gl vi 

ul U 




OTA 

2 /u 






D V" /-\ 

riO 




Th r- 

i nr 


Th r- 

i nr 


60 J 








T 1 -y r*\ 

irp 


irp 


TV 1 a 
Aid 


Glv 
\j i y 


He 


Trp 


Leu 


Asp 








320 


lie 


ax y 


A Q p 

Aap 


Va 1 






j J j 




T .01 1 


Va 1 

V Ct 1 


Th t* 

1 411 


Th t* 

A 1 1 1 










Arg 


vai 


i-iys 


U -i Q 

n 1 9 


*5 £ c 








Mp f 


Ala 


Thr 


Phe 


He 


Leu 


Ser 


Arg 








400 


irp 


Thr 


ui y 


Aap 






4 1 D 




m n 




Va 1 

VOl 


T iPi i 




din 






A c o 


He 

1 1 w 


Glv 
uiy 


Glv 


d A c: 
IIS 








Mpt 


Ion 

no ^ 


Lou 


Li P. 11 


Tyr 


Arg 


Ser 


His 








480 


Leu 


Pro 


Asp 


Tyr 






495 




Tyr 


Lys 


Phe 


Leu 




510 






Lys 


Gly 


His 


Pro 
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Val 


He 




D YT\ 

r I U 


Leu 


Phe 


Tyr 


niii 

Ulu 




530 










535 




Arg 


He 




Ren 


Glu 


Tyr 


Met 




545 










550 






lie 


Val 


opr 
oct 


u y o 


Glu 


Glu 


Ser 


A fa 










565 








Trp Tyr 


Zi c n 
Mb II 


iyr 


Trp Asn Gly 


w i U 








C Q ^ 










Lys 


Ser 


l 11 1 


ills 


Glu 


Leu 


Pro 


Tie 






CQC 

J 7 J 










D v v 


Pro 


Leu 


Vj 1 U 


vj iy 


Asp 


Glu 


Leu 


Tip 




610 










615 




Arg 


Tyr 


A O 

rtSp 


noil 


Ala 


Glu 


He 


TVi y~ 


625 










630 






Ser 


Arg 


Glu 


lie 


Tyr 


Val 


Ser 


Lys 










645 








Val 


Ser 


Lys 


lie 


He 


Val 


Asp 


Asp 








660 










Thr 


Met 


Gin 


Asn 


Thr 


Tyr 


Val 


Ala 






675 










680 


Lys 


He 


Asn 


Leu 


Glu 









690 



flic 


Gin 

v ill 


Asd 


ASD 


ASD 


Asd 


Met 


Tvr 








540 










Gl v 




Tvr 


Leu 


Leu 


Tvr 


Ala 


Pro 






555 

j j j 










560 

J v> V 


bell 


Val 


Thr 

1 J 1 L 




Pro 


Ara 


Glv 
uiy 


u y o 




CTrt 
O r V 














X J. c 


Tip 
lie 


Sen 


ni v 

vj i y 


T ,VQ 
u y o 


^e r 


Va 1 

V CI X 


Val 

vox 


CQC 

JO J 










5 90 






iyr 


uC u 




Glu 


Glv 


Se r 


He 


He 










fine. 








v a J. 


iyr 


Gl V 


Glu 


Thr 




Phe 


Live; 

u y w 








620 










Ser 


Ser 


Ser 


Asn 


Glu 


He 


Lys 


Phe 






635 










640 


Leu 


Thr 


He 


Thr 


Ser 


Glu 


Lys 


Pro 




650 










655 




Ser 


Lys 


Glu 


He 


Gin 


Val 


Glu 


Lys 


665 










670 






Lys 


He 


Asn 


Gin 


Lys 


He 


Arg 


Gly 



685 



<210> 6 
<211> 2082 
<212> DNA 

<213> Sulfolobus solfataricus 

<400> 6 

atggagacca tcaagatcta cgagaacaag ggcgtgtaca aggtggtgat cggcgagccg 60 
ttcccgccga tcgagttccc gctcgagcag aagatctcct ccaacaagtc cctctccgag 120 
ctgggcctca ccatcgtgca gcagggcaac aaggtgatcg tggagaagtc cctcgacctc 180 
aaggagcaca tcatcggcct eggegagaag gccttcgagc tggaccgcaa gcgcaagcgc 240 
tacgtgatgt acaacgtgga cgccggcgcc tacaagaagt accaggaccc getctaegtg 300 
tccatcccgc tcttcatctc cgtgaaggac ggcgtggcca ccggctactt cttcaactcc 360 
gcctccaagg tgatcttcga cgtgggcctc gaggagtacg acaaggtgat cgtgaccatc 420 
ceggaggact ccgtggagtt etaegtgate gagggcccgc gcatcgagga cgtgctcgag 480 
aagtacaccg agetgacegg caagccgttc ctcccgccga tgtgggcctt eggctacatg 540 
atctcccgct actcctacta cccgcaggac aaggtggtgg agctggtgga catcatgeag 600 
aaggagggct tccgcgtggc cggcgtgttc ctcgacatcc actacatgga ctcctacaag 660 
ctcttcacct ggcacccgta ccgcttcccg gagecgaaga agctcatcga cgagctgcac 720 
aagcgcaacg tgaagctcat caccatcgtg gaccacggca tccgcgtgga ccagaactac 780 
tccccgttcc tctccggcat gggcaagttc tgegagateg agtceggega gctgttcgtg 840 
ggcaagatgt ggccgggcac caccgtgtac ccggacttct tecgegagga cacccgcgag 900 
tggtgggccg gcctcatctc cgagtggctc tcccagggcg tggaeggcat ctggctcgac 960 
atgaacgagc cgaccgactt ctcccgcgcc atcgagatcc gcgacgtgct ctcctccctc 1020 
ccggtgcagt tccgcgacga ccgcctcgtg accaccttcc eggacaaegt ggtgcactac 1080 
ctccgcggca agcgcgtgaa gcacgagaag gtgcgcaacg cctacccgct etacgaggeg 1140 
atggccacct tcaagggctt ccgcacctcc caccgcaacg agatcttcat cctctcccgc 1200 
gccggctacg ccggcatcca gcgctacgcc ttcatctgga ccggcgacaa caccccgtcc 1260 
tgggacgacc tcaagctcca gctccagctc gtgctcggcc tctccatctc cggcgtgccg 1320 
ttcgtgggct gegacategg cggcttccag ggccgcaact tegecgagat cgacaactcg 1380 
atggacctcc tcgtgaagta ctacgccctc gccctcttct tcccgttcta ccgctcccac 1440 
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aaggccaccg acggcatcga caccgagccg gtgttcctcc cggactacta -caaggagaag 1500 

gtgaaggaga tcgtggagct gcgctacaag ttcctcccgt acatctactc cctcgccctc 1560 

gaggcctccg agaagggcca cccggtgatc cgcccgctct tctacgagtt ccaggacgac 1620 

gacgacatgt accgcatcga ggacgagtac atggtgggca agtacctcct ctacgccccg 1680 

atcgtgtcca aggaggagtc ccgcctcgtg accctcccgc gcggcaagtg gtacaactac 1740 

tggaacggcg agatcatcaa cggcaagtcc gtggtgaagt ccacccacga gctgccgatc 1800 

tacctccgcg agggctccat catcccgctc gagggcgacg agctgatcgt gtacggcgag 1860 

acctccttca agcgctacga caacgccgag atcacctcct cctccaacga gatcaagttc 1920 

tcccgcgaga tctacgtgtc caagctcacc atcacctccg agaagccggt gtccaagatc 1980 

atcgtggacg actccaagga gatccaggtg gagaagacca tgcagaacac ctacgtggcc 2040 

aagatcaacc agaagatccg cggcaagatc aacctcgagt ga 2082 

<210> 7 
<21b 1818 
<2\2> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 7 

atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 
gcgtccacgt tccgccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 
gcggacacgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggcacca gcaccagcag 180 
gcgcgccgcg gggccaggtt cccgtcgctc gtcgtgtgcg ccagcgccgg catgaacgtc 240 
gtcttcgtcg gcgccgagat ggcgccgtgg agcaagaccg gaggcctcgg cgacgtcctc 300 
ggcggcctgc cgccggccat ggccgcgaac gggcaccgtg tcatggtcgt ctctccccgc 360 
tacgaccagt acaaggacgc ctgggacacc agcgtcgtgt ccgagatcaa gatgggagac 420 
gggtacgaga cggtcaggtt cttccactgc tacaagcgcg gagtggaccg cgtgttcgtt 480 
gaccacccac tgttcctgga gagggtttgg ggaaagaccg aggagaagat ctacgggcct 540 
gtcgctggaa cggactacag ggacaaccag ctgcggttca gcctgctatg ccaggcagca 600 
cttgaagctc caaggatcct gagcctcaac aacaacccat acttctccgg accatacggg 660 
gaggacgtcg tgttcgtctg caacgactgg cacaccggcc ctctctcgtg ctacctcaag 720 
agcaactacc agtcccacgg catctacagg gacgcaaaga ccgctttctg catccacaac 780 
atctcctacc agggccggtt cgccttctcc gactacccgg agctgaacct ccccgagaga 840 
ttcaagtcgt ccttcgattt catcgacggc tacgagaagc ccgtggaagg ccggaagatc 900 
aactggatga aggccgggat cctcgaggcc gacagggtcc tcaccgtcag cccctactac 960 
gccgaggagc tcatctccgg catcgccagg ggctgcgagc tcgacaacat catgcgcctc 1020 
accggcatca ccggcatcgt caacggcatg gacgtcagcg agtgggaccc cagcagggac 1080 
aagtacatcg ccgtgaagta cgacgtgtcg acggccgtgg aggccaaggc gctgaacaag 1140 
gaggcgctgc aggcggaggt cgggctcccg gtggaccgga acatcccgct ggtggcgttc 1200 
atcggcaggc tggaagagca gaagggcccc gacgtcatgg cggccgccat cccgcagctc 1260 
atggagatgg tggaggacgt gcagatcgtt ctgctgggca cgggcaagaa gaagttcgag 1320 
cgcatgctca tgagcgccga ggagaagttc ccaggcaagg tgcgcgccgt ggtcaagttc 1380 
aacgcggcgc tggcgcacca catcatggcc ggcgccgacg tgctcgccgt caccagccgc 1440 
ttcgagccct gcggcctcat ccagctgcag gggatgcgat acggaacgcc ctgcgcctgc 1500 
gcgtccaccg gtggactcgt cgacaccatc atcgaaggca agaccgggtt ccacatgggc 1560 
cgcctcagcg tcgactgcaa cgtcgtggag ccggcggacg tcaagaaggt ggccaccacc 1620 
ttgcagcgcg ccatcaaggt ggtcggcacg ccggcgtacg aggagatggt gaggaactgc 1680 
atgatccagg atctctcctg gaagggccct gccaagaact gggagaacgt gctgctcagc 1740 
ctcggggtcg ccggcggcga gccaggggtt gaaggcgagg agatcgcgcc gctcgccaag 1800 
gagaacgtgg ccgcgccc 1818 

<210> 8 
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<211> 606 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 8 



Met 


Ala 


Aia 


Leu 


ai a 


inr 


O A "T" 

OCI 


oin 




va i 


Ala 


Thr 


Arg 


Ala 


Gly Leu 


1 








5 










10 










15 




Gly Val 


>-v 

Pro 


Asp 


Ala 


ser 


i nr 


Pne 


Arg 


Arg 


Gly Ala Ala 


Gin 


Gly 


Leu 








20 










25 










30 






Arg Gly 


Ala 


Arg 


Ala 


Ser 


Ala 


Ala 


Ala 


Asp 


Thr 


Leu 


Ser 


Met 


Arg 


Thr 






35 










4 0 










45 








Ser 


Ala 


Arg 


Ala 


Ala 


Pro 


Arg 


His 


Gin 


His 


Gin 


Gin 


Ala 


Arg Arg Gly 




50 










55 










60 










Ala 


Arg 


Phe 


Pro 


Ser 


Leu 


val 


Val 


Cys 


Ala 


Ser 


Ala 


Gly 


Met 


Asn 


Val 


65 










70 










75 










80 


Val 


Phe 


Val 


Gly 


Al a 


Glu 


Met 


Ala 


Pro 


Trp 


Ser 


Lys 


Thr Gly Gly 


Leu 










85 










90 










95 




Gly 


Asp 


Val 


Leu 


Gly 


Gly 


Leu 


Pro 


Pro 


Ala 


Met 


Ala Ala Asn Gly His 








100 










105 










110 






Arg 


Val 


Met 
115 


Val 


val 


Ser 


Pro 


Arg 
120 


Tyr 


Asp 


Gin 


Tyr 


Lys 
125 


Asp 


Ala 


Trp 


Asp 


Thr 


Ser 


llo 1 

Va 1 


Val 


Ser 


Glu 


lie 


Lys 


Met 


Gly 


Asp Gly Tyr Glu Thr 




13 0 










135 










140 










Val 


Arg 


Phe 


Phe 


His 


Cys 


Tyr 


Lys 


Arg 


Gly 


Val 


Asp 


Arg 


Val 


Phe 


Val 


14 5 










150 










155 










160 


m ____ 

ASp 


His 


Pro 


X **** i i 

Leu 


pne 
16 5 


Leu 


Glu 


Arg 


Val 


Trp 
170 


Gly 


Lys 


Thr 


Glu 


Glu 
175 


Lys 


T 1 ^ 

lie 


Ty r 


Gly 


pro 


va i 


Aia 


Gly Thr Asp Tyr 


Arg 


Asp 


Asn 


Gin 


Leu 


Arg 








180 










185 










190 






Phe 


Ser 


Leu 


Leu 


v_y s 


oin 


Ala 


Ala 


Leu 


Glu 


Ala 


Pro Arg 


He 


Leu 


Ser 






195 










200 










205 








Leu 


Asn 


as n 


Asn 


pro 


Tyr 


Phe 


Ser 


Gly 


Pro 


Tyr Gly Glu Asp 


Val 


Val 




210 










215 










220 










Phe 


Val 


Cys 


Asn 


Asp 


Trp 


His 


Thr Gly 


Pro 


Leu 


Ser 


Cys 


Tyr 


Leu 


Lys 


225 










230 










235 










240 


Ser 


Asn 


Tyr 


Gin 


Ser 
245 


His 


Gly 


He 


Tyr 


Arg 
250 


Asp 


Ala 


Lys 


Thr 


Ala 
255 


Phe 


Cys 


lie 


His 


Asn 


He 


Ser 


Tyr Gin Gly Arg 


Phe 


Ala 


Phe 


Ser 


Asp 


Tyr 








260 










265 










270 






Pro 


Glu 


Leu 
275 


Asn 


Leu 


Pro 


Glu 


Arg 
280 


Phe 


Lys 


Ser 


Ser 


Phe 
285 


Asp 


Phe 


He 


Asp 


Gly 


Tyr 


Glu 


Lys 


Pro 


Val 


Glu 


Gly Arg 


Lys 


lie 


Asn 


Trp 


Met 


Lys 




290 










295 










300 










Ala 


Gly 


lie 


Leu 


Glu 


Ala 


Asp 


Arg 


Val 


Leu 


Thr 


Val 


Ser 


Pro 


Tyr 


Tyr 


305 










310 










315 










320 


Ala 


Glu 


Glu 


Leu 


lie 


Ser 


Gly 


He 


Ala 


Arg 


Gly Cys 


Glu 


Leu 


Asp 


Asn 










325 










330 










335 




lie 


Met 


Arg 


Leu 


Thr 


Gly 


He 


Thr Gly 


He 


Val 


Asn Gly Met 


Asp 


Val 








340 










345 










350 






Ser 


Glu 


Trp 
355 


Asp 


Pro 


Ser 


Arg 


Asp 
360 


Lys 


Tyr 


He 


Ala 


Val 
365 


Lys 


Tyr 


Asp 
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Val 


Ser 


Thr 


Ala 


Val 


Glu 


Ala 


Lvs 




370 














Ala 


Glu 


Val 


Glv 

VJ Ay 


Leu 

m+ 


Pro 


Val 


Asd 


385 










3 90 

J Z> V 






lie 


Gly 


mi y 


T.P1 1 


Glu 


Glu 


Gin 

will 


T.VQ 

u y o 










*t V 5 








He 


Pro 


CZ 1 n 




rlc u 


Gill 


Mph 
llv? u 


Ua 1 

v a j. 








** £ U 










Gly Thr 


u i y 


T A/ c 


T ,\/ c: 
U j s> 


T iVQ 
u y s 


Phe 


Glu 






A T C 
H J D 










A A. Cl 
4 4 w 


Lys 


Phe 


Dm 


yi y 


T A/ o. 

by a 


v a i. 


nL y 


Ala 




4 50 










A e; c. 




Ala 


His 


Ui e 
r. 1 a 


Tip 
lie 


Mp t* 

1 IC l 


Ala 




Ala 


465 










^ n c\ 






Phe 


Glu 


r iu 




uiy 


T .p 1 1 


Tip 
x x c 


G 1 n 










HOD 








Pro 


Cys 


t 1 a 
rtla 


V- y «D 


A 1 A 


^PT 


Th r 


vxy 








c n n 

Jw v 










Gly 


Lys 


T* \\ y 
1 11 i 


j. y 


r lie 


Hie 


Mpf- 


G 1 v 






ri c 
DID 












Val 


Glu 


Fro 


A J. a 


ASp 


V a i 


i_»y S 


T wo 

L»y » 




530 










535 




He 


Lys 


Val 


Val 


Gly 


Thr 


Pro 


Ala 


545 










550 






Met 


He 


Gin 


Asp 


Leu 


Ser 


Trp 


Lys 










565 








val 


Leu 


Leu 


Ser 


Leu 


Gly 


Val 


Ala 








580 










Glu 


Glu 


lie 


Ala 


Pro 


Leu 


Ala 


Lys 



595 600 



Ala 


Leu 


Asn 


Lvs 


Glu 


Ala 


Leu 


Gin 








380 










♦ » i- ^3 


Asn 


He 


Pro 


Leu 


Val 


Ala 


Phe 






395 










400 


Glv 


Pro 


Asd 


Val 


Met 


Ala 


Ala 


Ala 




4 10 

I X V 










415 




Glu 


AsD 


Val 


Gin 


He 


Val 


Leu 


Leu 


4 9 5 
*t & j 










4 3 0 






irn 


Met 

II w V 


Leu 

U W v* 


Met 


Ser 


Ala 


Glu 


Glu 










44 5 

*1 *T -J 








Val 


Val 


Lvs 


Phe 


Asn 


Ala 


Ala 


Leu 








4 60 

T w V 










nop 


Va 1 


Leu 


Ala 


Val 


Thr 

i 111. 


Ser 


Ara 






4 7 5 










4 80 


U w \i4 


Gin 


Glv 

\jx y 


Met 


Ara 


Tvr 


Gly 


Thr 




4 90 

T ^ V 










4 95 




Glv 

vjxy 


Leu 

M W Vi 


Val 


Asd 


Thr 


He 

« * ^ 


He 


Glu 


J V J 










510 






Ara 


Leu 


Ser 


val 


Asp 


Cvs 


Asn 


Val 










52 5 








vai 


Ala 

Aia 


i nr 


i nr 


bell 


\J 1 11 


ml y 


A 1 A 

Mia 








540 










Tyr 


Glu 


Glu 


Met 


Val 


Arg 


Asn 


Cys 






555 










560 


Gly 


Pro 


Ala 


Lys 


Asn 


Trp 


Glu 


Asn 




570 










575 




Gly 


Gly 


Glu 


Pro 


Gly 


Val 


Glu 


Gly 


585 










590 






Glu 


Asn 


Val 


Ala 


Ala 


Pro 







605 



<210> 9 
<211> 2223 
<212> DNA 

<213> Artificial Sequence 
<220> 

< 2 2 3 > synthetic 
<400> 9 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 

gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 

gccggcatct ccgcgatctg gataccgcca gcttccaagg geatgtcegg gggctactcg 180 

atgggctacg acccgtacga ctacttcgac cteggegagt actaccagaa gggcacggtg 240 

gagacgeget tegggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 

ggcatcaagg teatcgegga categtgate aaccacaggg ccggcggcga cctggagtgg 360 

aacccgttcg teggegacta cacctggacg gacttctcca aggtcgcctc eggcaagtae 420 

accgccaact acctcgactt ccaccccaac gagctgeacg cgggcgactc cggcacgttc 480 

ggcggctacc eggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 

caggagtcct acgcggccta cctgcgctcc ateggcateg acgcgtggcg cttcgactac 600 

gtcaagggct aeggggectg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 

gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggecta ctcctccggc 720 

gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 

177 



WO 2005/096804 



PCT/US2004/007182 



atcccggcgc tcgtcgaggc cctgaagaac ggcggcacgg tggtctcccg cgacccgttc 840 
aaggccgtga ccttcgccgc caaccacgac acggacatca tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta cgagggccag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct cggcggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtacggcta ctccgtgtgg tcctactgcg gcgtcggcac atcgattgct 1320 
ggcatcctcg aggccgacag ggtcctcacc gtcagcccct actacgccga ggagctcatc 1380 
tccggcatcg ccaggggctg cgagctcgac aacatcatgc gcctcaccgg catcaccggc 1440 
atcgtcaacg gcatggacgt cagcgagtgg gacc'ccagca gggacaagta catcgccgtg 1500 
aagtacgacg tgtcgacggc cgtggaggcc aaggcgctga acaaggaggc gctgcaggcg 1560 
gaggtcgggc tcccggtgga ccggaacatc ccgctggtgg cgttcatcgg caggctggaa 1620 
gagcagaagg gccccgacgt catggcggcc gccatcccgc agctcatgga gatggtggag 1680 
gacgtgcaga tcgttctgct gggcacgggc aagaagaagt tcgagcgcat gctcatgagc 1740 
gccgaggaga agttcccagg caaggtgcgc gccgtggtca agttcaacgc ggcgctggcg 1800 
caccacatca tggccggcgc cgacgtgctc gccgtcacca gccgcttcga gccctgcggc 1860 
ctcatccagc tgcaggggat gcgatacgga acgccctgcg cctgcgcgtc caccggtgga 1920 
ctcgtcgaca ccatcatcga aggcaagacc gggttccaca tgggccgcct cagcgtcgac 1980 
tgcaacgtcg tggagccggc ggacgtcaag aaggtggcca ccaccttgca gcgcgccatc 2040 
aaggtggtcg gcacgccggc gtacgaggag atggtgagga actgcatgat ccaggatctc 2100 
tcctggaagg gccctgccaa gaactgggag aacgtgctgc tcagcctcgg ggtcgccggc 2160 
ggcgagccag gggttgaagg cgaggagatc gcgccgctcg ccaaggagaa cgtggccgcg 2220 
ccc 2223 



<210> 10 

<211> 741 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 10 
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580 585 590 

Val Lys Phe Asn Ala Ala Leu Ala His His He Met Ala Gly Ala Asp 

595 600 605 

Val Leu Ala Val Thr Ser Arg Phe Glu Pro Cys Gly Leu He Gin Leu 

610 615 620 

Gin Gly Met Arg Tyr Gly Thr Pro Cys Ala Cys Ala Ser Thr Gly Gly 
625 630 635 640 

Leu Val Asp Thr He He Glu Gly Lys Thr Gly Phe His Met Gly Arg 

645 650 655 

Leu Ser Val Asp Cys Asn Val Val Glu Pro Ala Asp Val Lys Lys Val 

660 665 670 

Ala Thr Thr Leu Gin Arg Ala He Lys Val Val Gly Thr Pro Ala Tyr 

675 680 685 

Glu Glu Met Val Arg Asn Cys Met He Gin Asp Leu Ser Trp Lys Gly 

690 695 700 

Pro Ala Lys Asn Trp Glu Asn Val Leu Leu Ser Leu Gly Val Ala Gly 
705 710 715 720 

Gly Glu Pro Gly Val Glu Gly Glu Glu He Ala Pro Leu Ala Lys Glu 

725 730 735 

Asn Val Ala Ala Pro 

740 



<210> 11 
<211> 1515 
<212> DNA 
<213> Zea mays 

<400> 11 

ggagagctat gagacgtatg tcctcaaagc cactttgcat tgtgtgaaac caatatcgat 60 
ctttgttact tcatcatgca tgaacatttg tggaaactac tagcttacaa gcattagtga 120 
cagctcagaa aaaagttatc tatgaaaggt ttcatgtgta ccgtgggaaa tgagaaatgt 180 
tgccaactca aacaccttca atatgttgtt tgcaggcaaa ctcttctgga agaaaggtgt 240 
ctaaaactat gaacgggtta cagaaaggta taaaccacgg ctgtgcattt tggaagtatc 300 
atctatagat gtctgttgag gggaaagccg tacgccaacg ttatttactc agaaacagct 360 
tcaacacaca gttgtctgct ttatgatggc atctccaccc aggcacccac catcacctat 420 
ctctcgtgcc tgtttatttt cttgcccttt ctgatcataa aaaaacatta agagtttgca 480 
aacatgcata ggcatatcaa tatgctcatt tattaatttg ctagcagatc atcttcctac 540 
tctttacttt atttattgtt tgaaaaatat gtcctgcacc tagggagctc gtatacagta 600 
ccaatgcatc ttcattaaat gtgaatttca gaaaggaagt aggaacctat gagagtattt 660 
ttcaaaatta attagcggct tctattatgt ttatagcaaa ggccaagggc aaaattggaa 720 
cactaatgat ggttggttgc atgagtctgt cgattacttg caagaaatgt gaacctttgt 780 
ttctgtgcgt gggcataaaa caaacagctt ctagcctctt ttacggtact tgcacttgca 840 
agaaatgtga actccttttc atttctgtat gtggacataa tgccaaagca tccaggcttt 900 
ttcatggttg ttgatgtctt tacacagttc atctccacca gtatgccctc ctcatactct 960 
atataaacac atcaacagca tcgcaattag ccacaagatc acttcgggag gcaagtgcga 1020 
tttcgatctc gcagccacct ttttttgttc tgttgtaagt ataccttccc ttaccatctt 1080 
tatctgttag tttaatttgt aattgggaag tattagtgga aagaggatga gatgctatca 1140 
tctatgtact ctgcaaatgc atctgacgtt atatgggctg cttcatataa tttgaattgc 1200 
tccattcttg ccgacaatat attgcaaggt atatgcctag ttccatcaaa agttctgttt 1260 
tttcattcta aaagcatttt agtggcacac aatttttgtc catgagggaa aggaaatctg 1320 
ttttggttac tttgcttgag gtgcattctt catatgtcca gttttatgga agtaataaac 1380 
ttcagtttgg tcataagatg tcatattaaa gggcaaacat atattcaatg ttcaattcat 1440 
cgtaaatgtt ccctttttgt aaaagattgc atactcattt atttgagttg caggtgtatc 1500 

180 



WO 2005/096804 



PCT/US2004/007182 



tagtagttgg aggag 1515 

<210> 12 

<211> 673 

<212> DNA 

<213> Zea mays 

<400> 12 

gatcatccag gtgcaaccgt ataagtccta aagtggtgag gaacacgaaa caaccatgca 60 

ttggcatgta aagctccaag aatttgttgt atccttaaca actcacagaa catcaaccaa 120 

aattgcacgt caagggtatt gggtaagaaa caatcaaaca aatcctctct gtgtgcaaag 180 

aaacacggtg agtcatgccg agatcatact catctgatat acatgcttac agctcacaag 240 

acattacaaa caactcatat tgcattacaa agatcgtttc atgaaaaata aaataggccg 300 

gacaggacaa aaatccttga cgtgtaaagt aaatttacaa caaaaaaaaa gccatatgtc 360 

aagctaaatc taattcgttt tacgtagatc aacaacctgt agaaggcaac aaaactgagc 420 

cacgcagaag tacagaatga ttccagatga accatcgacg tgctacgtaa agagagtgac 480 

gagtcatata catttggcaa gaaaccatga agctgcctac agccgtctcg gtggcataag 540 

aacacaagaa attgtgttaa ttaatcaaag ctataaataa cgctcgcatg cctgtgcact 600 

tctccatcac caccactggg tcttcagacc attagcttta tctactccag agcgcagaag 660 

aacccgatcg aca 673 

<210> 13 
<211> 454 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 13 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met 

20 25 30 

Gin Ala Phe Tyr Trp Asp Val Pro Ser Gly Gly He Trp Trp Asp Thr 

35 40 45 

He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala lie 

50 55 60 

Trp He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly 
65 70 75 80 

Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly 

85 90 95 

Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu lie Asn Met lie 

100 105 110 

Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He 

115 120 125 

Asn His Arg Ala Gly Gly Asp Leu Glu Trp Asn Pro Phe Val Gly Asp 

130 135 140 

Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala 
145 150 155 160 

Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly 

165 170 175 

Thr Phe Gly Gly Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin 

180 185 190 
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Thr 


Gly Asn 


Leu 


Gly 








405 










410 










415 




Gly 


Trp 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 








420 










425 










430 






Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 


Gly Gin 


Tyr Gly Tyr Ser Val 


Trp 






435 










440 










445 








Ser 


Tyr 


Cys 


Gly Val 


Gly 
























450 































<210> 14 

<211> 460 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 14 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu Leu Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 






15 




Ala 


Thr 


Ser 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu Glu Gly Gly Val 


He 


Met 








20 










25 






30 






Gin 


Ala 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser Gly Gly He 


Trp 


Trp 


Asp 


Thr 






35 










40 






45 








lie 


Arg 


Gin 


Lys 


He 


Pro 


Glu 


Trp 


Tyr 


Asp Ala Gly 


He 


Ser 


Ala 


He 




50 










55 






60 










Trp 


He 


Pro 


Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser Gly Gly 


Tyr 


Ser 


Met 


Gly 
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Tyr 


Asp 


Pro 


Tyr 


Thr 


val 


Glu 


Thr 








100 


Asn 


Thr 


Ala 


His 






115 




Asn 


_ _ ♦ 

HIS 


Arg 


Ala 




130 






Tyr 


Thr 


Trp 


Thr 


14 5 








Asn 


Tyr 


Leu 


Asp 


Thr 


Phe 


Gly 


Gly 








180 


Tyr 


Trp 


Leu 


Trp 






195 




Tie 


Gly 


lie 


Asp 




210 






Trp 


Val 


val 


Lys 


225 








Glu 


Tyr 


Trp 


Asp 


Ser 


Gly 


Ala 


Lys 








260 


Ala 


Phe 


Asp 


Asn 






275 




Gly 


Gly 


Thr 


Val 




290 






Ala 


Asn 


His 


Asp 


305 








Phe 


lie 


Leu 


Thr 


Glu 


Glu 


Trp 


Leu 








34 0 


Asp 


Asn 


Leu 


Ala 






355 




Glu 


Met 


lie 


Phe 




370 






Thr 


Tyr 


lie 


Asn 


385 








Pro 


Lys 


Phe 


Ala 


Gly 


Trp 


Val 


Asp 








420 


Ala 


Pro 


Ala 


Tyr 






435 




Ser 


Tyr 


Cys 


Gly 




450 







70 



Asp 


Tyr 


Phe 


Asp 


85 








Arg 


Phe Gly 


Ser 


Ala 


Tyr Gly 


He 








120 


Gly Gly Asp 


Leu 






135 




Asp 


Phe 


Ser 


Lys 




150 






Phe 


His 


Pro 


Asn 


165 








Tyr 


Pro 


Asp 


He 


Ala 


Ser 


Gin 


Glu 








200 


Ala 


Trp 


Arg 


Phe 






215 




Asp 


Trp 


Leu 


Asn 




230 






Thr 


Asn 


Val 


Asp 


245 








Val 


Phe 


Asp 


Phe 


Lys 


Asn 


lie 


Pro 








280 


Val 


Ser 


Arg 


Asp 






295 




Thr 


Asp 


lie 


He 




310 






Tyr 


Glu 


Gly 


Gin 


325 








Asn 


Lys 


Asp 


Lys 


Gly 


Gly 


Ser 


Thr 








360 


Val 


Arg 


Asn 


Gly 






375 




Leu 


Gly Ser 


Ser 




390 






Gly Ala 


Cys 


He 


405 








Lys 


Tyr 


Val 


Tyr 


Asp 


Pro 


Ala 


Asn 








440 


Val 


Gly 


ser 


Glu 






455 









7 5 




Leu Gly 


Mill 

GIU 


Tyr 




90 






uyb 




Glu 


Leu 


105 








Lys 


Val 


lie 


Aia 


uiu 


i rp 


Asn 


Pro 








140 


vai 


Aia 


Ser 


Gly 






155 




Glu 


Leu 


IT ' 

His 


Ala 




170 






Cys 


HIS 


Asp 


Lys 


185 








Ser 


Tyr 


Ala 


Ala 


Asp 


Tyr 


Val 


Lys 








220 


Trp 


Trp 


Gly 


Gly 






235 




Ala 


Leu 


Leu 


Asn 




*\ n 

2 50 






Pro 


Leu 


Tyr 


Tyr 


265 








Ala 


Leu 


Val 


Glu 


Pro 


Phe 


Lys 


Ala 








300 


Trp 


Asn 


T . • mm 

Lys 


Tyr 






315 




Pro 


Thr 


He 


Phe 




330 






Leu 


Lys 


Asn 


Leu 


345 








Ser 


He 


1 

vai 


Tyr 


Tyr Gly 


Ser 


Lys 








380 


Lys 


Val 


Gly 


Arg 






395 




His 


Glu 


Tyr 


Thr 




410 






Ser 


Ser 


Gly 


Trp 


425 








Gly Gin 


Tyr 


Gly 


Lys 


Asp 


Glu 


Leu 



Tyr 


Lrin 


bys 


Gly 










T 1 a 

lie 


Asn 


net 


i ie 




1 1 u 






Asp 


Ti- 
ne 


vai 


T 1 A 

lie 


1/3 








Pne 


val 


Gly 


Asp 


Lys 


Tyr 


Thr 


Ala 








lou 


Gly 


Asp 


Ser 


Gly 






1 / D 




Ser 


Trp 


Asp 


Gin 




190 






Tyr 


Leu 


Arg 


Ser 


205 








Gly 


Tyr 


Gly 


Ala 


Trp 


Ala 


Val 


Gly 








24 0 


Trp 


Ala 


Tyr 


Ser 






o c c 
2 55 




Lys 


Met 


Asp 


Ala 




2 70 






Ala 


T - - 

Leu 


Lys 


Asn 


O Q C 








Val 


Thr 


Pne 


ir ^ 1 

val 


Pro 


Ala 


Tyr 


Ala 








J z u 


Tyr 


Arg 


Asp 


Tyr 






"5 *> C 
J J D 




He 


Trp 


He 


HIS 




t r a 

j b 0 






Tyr 


Asp 


Ser 


ASp 


"1 /* CT 

365 








Pro 


Gly 


Leu 


He 


Trp 


Val 


Tyr 


Val 








400 


Gly 


Asn 


Leu 


Gly 






415 




Val 


Tyr 


Leu 


Glu 




430 






Tyr 


Ser 


Val 


Trp 



<210> 15 
<211> 518 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 15 



net 


Leu 


ft "] -» 

Aia 


Aia 




Ala 
Al a 


1 llL 


Ser 


Gin 


Leu 


Va 1 


Ala 


Thr Arg Ala Gly 


1 








b 










10 










15 




Leu 


uiy 


va i 


pro 


ASp 


Ala 


C a y 


Thr 


Phe 


Arg 


a y n 
r\L y 


Gly Ala 


Ala 


Gin Gly 








2 0 










25 










30 






Leu 


Arg 


Giy 

3 5 


Aia 


Arg 


Aia 


O O V 

bcr 


Ala 
40 


Ala 


Ala 


Ann 
Mop 


Thr 


Leu 

45 


Ser 


Met 


Arg 


Thr 


Ser 
50 


Ala 


Arg 


Aia 


Aia 


T~} v- /"X 

r ro 

55 


Arg 


His 


Gin 


fl X a 


Gin 
60 


Gin 


Ala 


Arg 


Arg 


Gly 


Ala 


Arg 


Phe 


Pro 


Ser 


Leu 


Val 


Val 


Cys 


Ala 


Ser 


Ala 


Gly Ala 


Met 


65 










70 










"7 C 
f D 










80 


Ala 


Lys 


Tyr 


Leu 


Cj 1 u 


Leu 


UlU 


Glu 


Gly Gly 


\/a 1 


He 


Met 


Gin 


Ala 


Phe 










85 










90 










95 




Tyr 


Trp 


Asp 


Val 
100 


Pro 


Ser 


Gly 


Gly 


He 
105 


Trp 


Trp 


Asp 


Thr 


He 
110 


Arg 


Gin 


Lys 


He 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala Gly 


He 


Ser 


Ala 


He 


Trp 


He 


Pro 






115 










120 










125 








Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly Gly Tyr 


Ser 


Met 


Gly Tyr Asp 


Pro 




130 










135 










140 










Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys Gly Thr Val Glu 


145 










150 










155 










160 


Thr 


Arg 


Pne 


Gly 


ser 


Lys 


oin 


Glu 


Leu 


He 


Asn 


Met 


lie 


Asn 


Thr 


Ala 








165 










170 










175 




His 


Ala 


Tyr 


Gly 
180 


Tie. 

lie 


Lys 


va i 


He 


Ala 
185 


Asp 


He 


Val 


lie 


Asn 
190 


His 


Arg 


Ala 


Gly 


Gly 


Asp 


T At! 

Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly Asp Tyr 


Thr 


Trp 






195 










200 










205 








Thr 


Asp 
210 


Pne 


Ser 


Lys 


va i 


ft 1 -J 

Aia 

215 


Ser 


Gly 


Lys 


Tyr 


Thr 
220 


Ala 


Asn 


Tyr 


Leu 


Asp 


Phe 


HIS 


Pro 


Asn 


UlU 


Leu 


His 


Ala 


Gly Asp 


Ser 


Gly 


Thr 


Phe 


Gly 


2 25 










1 *s r\ 

2 3 0 










235 










240 


Gly 


Tyr 


Pro 


Asp 


He 

O A C 


Cys 


His 


Asp 


Lys 


Ser 
250 


Trp 


Asp 


Gin 


Tyr 


Trp 
255 


Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


He 


Gly 


He 






2 60 










265 










270 






Asp 


Ala 


Trp 


Arg 


Phe 


Asp 


Tyr 


val 


Lys Gly Tyr Gly Ala Trp 


Val 


Val 






275 










280 










285 








Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp 


Gly Gly Trp 


Ala 


Val 


Gly Glu 


Tyr 


Trp 




290 










295 










300 










Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


Ala 


305 










310 










315 










320 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 


Asp 








325 










330 










335 




Asn 


Lys 


Asn 


He 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn Gly Gly Thr 






340 










345 










350 






Val 


Val 


Ser 
355 


Arg 


Asp 


Pro 


Phe 


Lys 
360 


Ala 


Val 


Thr 


Phe 


Val 
365 


Ala 


Asn 


His 


Asp 


Thr 
370 


Asp 


He 


He 


Trp 


Asn 
375 


Lys 


Tyr 


Pro 


Ala 


Tyr 
380 


Ala 


Phe 


He 


Leu 
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Thr Tyr Glu Gly 
385 

Leu Asn Lys Asp 

Ala Gly Gly Ser 

420 

Phe Val Arg Asn 
435 

Asn Leu Gly Ser 
450 

Ala Gly Ala Cys 
465 

Asp Lys Tyr Val 

Tyr Asp Pro Ala 

500 

Gly Val Gly Thr 
515 



<210> 16 

<211> 820 

<212> PRT 

<213> Artificia 

<220> 

<223> synthetic 



<400> 16 



Met 


Leu 


Ala 


Ala 


1 








Leu Gly 


Val 


Pro 








20 


Leu 


Arg 


Gly 


Ala 






35 




Thr 


Ser 


Ala 


Arg 




50 






Gly Ala 


Arg 


Phe 


65 








Ala 


Lys 


Tyr 


Leu 


Tyr 


Trp 


Asp 


Val 








100 


Lys 


lie 


Pro 


Glu 






115 




Pro 


Ala 


Ser 


Lys 




130 






Tyr 


Asp 


Tyr 


Phe 


14 5 








Thr 


Arg 


Phe 


Gly 


His 


Ala 


Tyr 


Gly 








180 


Ala Gly 


Gly 


Asp 



Gin 

NJ ill 


Pro 


Thr 


He 




J <7 \J 








l—t 14 


Lys 


Asn 


4 0 S 








Thr 


Se r 


He 


Val 


Pi v 




Gly Ser 








440 


Ser 


Lys 


val 


Gly 






455 




lie 


His 


Glu 


Tyr 




470 






Tyr 


Ser 


Ser 


Gly 


485 








Asn 


Gly 


Gin 


Tyr 



Ser He 



Sequence 



Leu Ala Thr Ser 

5 

Asp Ala Ser Thr 

Arg Ala Ser Ala 

40 

Ala Ala Pro Arg 
55 

Pro Ser Leu Val 
70 

Glu Leu Glu Glu 
85 

Pro Ser Gly Gly 

Trp Tyr Asp Ala 

120 

Gly Met Ser Gly 
135 

Asp Leu Gly Glu 
150 

Ser Lys Gin Glu 
165 

He Lys Val lie 
Leu Glu Trp Asn 



Phe 

rue 


Tvr 
i y l 


Aro 


Asp 












T 1 e 
lie 


Tr~o 


He 




din 






Tur 
iy l 


Tvr 


Asd 


Ser 












Pro 


Glv 


Leu 


Arg 


Trp 


Val 


Tyr 








460 


Thr 


Gly 


Asn 


Leu 






475 




Trp 


Val 


Tyr 


Leu 




490 






Gly 


Tyr 


Ser 


Val 


505 










Toil 
IjCU 


Va 1 


Al a 




1 ft 
1 U 






php 

r lie 


A t*ci 




Glv 










Ala 


Ala 

old 


Sen 


Thr 


His 


Gin 


His 


Gin 








60 


v a i 


y © 


Al a 


O C J. 






*7 ^ 




\j±y 


gi v 


Val 

V U 1 


He 

X X C 










Tip 


irp 


irp 


Ann 
no 


i n^ 








Gl v 


He 

J. X 


5?e r 


Ala 


Glv 


Tvr 


Ser 


Met 








140 


Tyr 


Tyr 


Gin 


Lys 






155 




Leu 


He 


Asn 


Met 




170 






Ala 


Asp 


He 


Val 


185 








Pro 


Phe 


Val 


Gly 




185 





Tyr 


Glu 


Glu 


Trp 








400 


His 


Asd 


Asn 


Leu 






415 

1 1 J 






Gl u 

Ul u 


Met 


He 

11C 




4 30 






lie 


Thr 

All lr 


Tvr 


lie 


445 








Val 


Pro 


Lys 


Phe 


Gly 


Gly 


Trp 


Val 








480 


Glu 


Ala 


Pro 


Ala 






495 




Trp 


Ser 


Tyr 


Cys 




510. 






I ill 


A r*rr 


Al a 

r\l a 


Glv 






i 

1 3 




A 1 ^ 
Mid 


Al a 
Mia 


Gl n 

will 


Glv 




n 








Cor 

o C 1 


Met 


Ara 










Gin 


Ala 


X ^ 


Aira 

i» x y 


Ala 


Gly 


Ala 


Met 








80 


Met 

l ie v 


Gin 


Ala 


Phe 






7 3 




J. Ill 


Tie 

11C 




Gin 




110 
1 1« 






lie 

X x w 


Trn 


He 


Pro 


1 ?5 

X L, ~> 








Glv 


Tvr 


Aso 


Pro 


Gly 


Thr 


Val 


Glu 








160 


He 


Asn 


Thr 


Ala 






175 




He 


Asn 


His 


Arg 




190 






Asp 


Tyr 


Thr 


Trp 
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Thr 


Asp 


pne 


P /T> y 

ber 




210 






Asp 


Phe 


HIS 


rro 


225 








Gly Tyr 


pro 


ASp 


Trp 


Ala 


Ser 


Gin 








"5 £ A 


Asp 


Ala 


Trp 


Arg 






*5 *7 C 




Lys 


Asp 


Trp 


Leu 




290 






Asp 


Thr 


Asn 


Val 


305 








Lys 


Val 


Pne 


ASp 


Asn 


Lys 


Asn 


lie 








"i A A 

3 4 0 


Val 


Val 


Ser 


Arg 






3 55 




Asp 


Tnr 


Asp 


He 




370 






Thr 


Tyr 


Glu 


Gly 


385 








Leu 


Asn 


Lys 


ASp 


Ala 


Gly 


Gly 


Ser 








A O A 

4 Z U 


Phe 


Val 


Arg 


Asn 






4 J 5 




Asn 


Leu 


Gly 


Ser 




450 






Ala 


Gly 


Ala 


cys 


465 








Asp 


Lys 


Tyr 


va i 


Tyr 


Asp 


Pro 


Ala 








C A A 


Gly 


Val 


Gly 


i nr 






C 1 c 




Thr 


Val 


f*\ W I >' 

ber 


pro 




530 






Gly Cys 


OIU 


LicU 


545 








Val 


Asn 


Gly 


Met 


He 


Ala 


Val 


Lys 








580 


Asn 


Lys 


Glu 


Ala 






595 




lie 


Pro 


Leu 


Val 




610 






Asp 


Val 


Met 


Ala 



T A/ c 

by © 


\/a 1 
val 


Ala 


Qav 
OCi 






-L- *J 




Ren 
ASH 


Vj 1 U 




HIS 




? 3 0 

A. J V 






Tip 


cys 


nis 




c. 4 D 








uiu 


C o »- 


iyr 


Aid 


Phe 


Asp 


Tyr 


Val 








■? ft n 

£, OK) 


Asn 


Trp 


Trp 


\3 1 y 






o q 5 
^ ^ -j 




ASp 


ai a 


L>eu 


T Alt 

LicU 




3 1 U 






nun 

rne 


D "r~ /~\ 


Lieu 


iyr 


1 ■? g> 

J £. Z> 








pro 


a i a 


j_»eu 


va i 


Asp 


Pro 


Phe 


Lys 








7 n 

O D v 


He 


Trp 


Asn 


by s 






j / D 




pi _ 
Gin 


"V y"v 

pro 


i nr 


lie 




■J Q A 






Lys 


Leu 


Lys 


Asn 


A A C 








Tnr 


-v-- 

ber 


Tin 

lie 


va l 


Gly 


Tyr 


Gly 


Ser 








A A A 


Ser 


Lys 


va i 


l?i y 










Tl a 

lie 


MIS 


ulU 


Tyr 




d "7 A 






ryr 


C o r* 


Coy 


ui y 


A ft c: 
t O D 








ft O 1^ 

asu 


pit, 

\j i y 


m n 
win 


iyr 


Ser 


He 


Ala 


Gly 








c in 

_> £. u 


iyr 


iyr 


Ala 


OIU 






5 5 5 
j j j 




ASp 


Ren 

as n 


lie 


rlc L 




550 






Asp 


Val 


Ser 


Glu 


565 








Tyr 


Asp 


Val 


Ser 


Leu 


Gin 


Ala 


Glu 








600 


Ala 


Phe 


He 


Gly 






615 




Ala 


Ala 


He 


Pro 



Gly 


Lys 


Tyr 


Thr 








220 


Mid 




Asn 


Ser 






235 




T a/ a 
by o 


Coy 


1 x F 






2 50 

C «^ v 






rvx d 


iyr 


LICU 


A TO 
fix. 










Liy s 


o i y 


TS/ y 

iyr 


filw 


Gly 


Trp 


Ala 


Val 








300 


A r n 

Ab II 


i rp 


Ala 

Mid 


iyr 






315 
j j. j 




iyr 


T A/ c 








3 3 0 

J J V 








nla 


T .P 1 1 












A l a 

Al a 


V d 1 


i nr 




Tyr 


Pro 


Ala 


Tyr 








380 

V w w 


rflc 


iyr 


7\ yrt 

Arg 


Ben 
ASp 






3 Q5 




LicU 


Tip 
11C 


irp 


Tip 
lit; 




4 10 






iyr 


TV / y 

iyr 


AS p 




4 2 5 

T i. J 








uy s 


fro 


n 1 \/ 
uiy 


T .on 


Arg 


Trp 


Val 


Tyr 








4 0 


inr 


\ji y 


Mo 1 1 


T .on 






475 




i rp 


v a x 


iy i 


T.pii 




490 

^ w 






u 1 V 


iyr 


Car 


Val 


505 

J u J 








Tie 




Glu 


Ala 


Glu 


Leu 


lie 


Ser 








540 




U w u 


Thi" 
i in 


Glv 






rcc 




Trp 


Asp 


Pro 


Ser 




570 






Thr 


Ala 


Val 


Glu 


585 








val 


Gly 


Leu 


Pro 


Arg 


Leu 


Glu 


Glu 








620 


Gin 


Leu 


Met 


Glu 



186 



Ala 




Tvr 




Gly 


Thr 


Phe 


Gly 








240 


Gl n 

will 


TV/ y 

iyr 










255 
j j 




Cor* 
OCX 


T1p 

IXC 




Tip 

X X c 




270 






Al a 
MX a 


Trn 

i rp 


1 

V CI X 


Va 1 

V CI X 


t O J 








vji y 


m ii 

ui U 


Twr 
iyr 


i rp 


Ser 


Ser 


Gly 


Ala 








320 

9m V 


Ala 
Mid 


Ala 
Mid 


Php 
flic 


A a t\ 
MS p 






335 




Ann 

Ho 11 


vjiy 


o x y 


Th v 
1 111 




550 






V d 1 


Ala 
Aid 


Asn 


n 1 o 


JO? 








Ala 
Aid 




Tip 

lie 


T .o 1 1 
Lie U 


Tyr 


Glu 


Glu 


Trp 








4 00 

1 V N/ 


MIS 


ASp 


Asn 


JjcU 






4 1 J 




MSp 




Mot* 


Tip 
lie 




410 






T 1 o 
11c 


Thr 

i nr 


T^\ / y 

iyr 


T1p 
lie 


4 4 5 

*t 1 Li 








\fa 1 
Val 


D y ^> 


Liy ci 


Php 
rile 


Gly 


Gly 


Trp 


Val 








4 80 

i U V 


U i. u 


Al a 

/A X CI 


Pro 


Ala 






495 




irp 


Cpr 


Tvr 
i yi 






510 








A TO 

Mxy 


Va 1 


Leu 

w W 


525 








Glv 
uxy 


He 

X X ^ 


Ala 


Ara 


He 


Thr 


Gly 


He 








560 

J V V 


Arg 


Asp 


Lys 


Tyr 






575 




Ala 


Lys 


Ala 


Leu 




590 






Val 


Asp 


Arg 


Asn 


605 








Gin 


Lys 


Gly 


Pro 


Met 


Val 


Glu 


Asp 
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£ 0 c. 

0 C D 










630 










635 










640 


Val 


Gin 


lie 


Va 1 


Leu 

V* 


Leu 


Gly Thr Gly Lys 


Lys 


Lys 


Phe 


Glu 


Arg 


Met 










U T «^ 










650 










655 




Leu 


Met 


J. 


Ala 


Glu 


Glu 


Lys 


Phe 


Pro Gly 


Lys 


Val 


Arg 


Ala 


Val 


Val 








6 £ 










665 










670 






Lys 


Phe 


Asn 


Ala 


Ala 


Leu 


Ala 


His 


His 


lie 


Met 


Ala Gly Ala Asp Val 




c *7 c; 










680 

\J V V 










685 








Leu 


Ala 




Th T" 


Cor 

C i- 




Phe 


Glu 


Pro Cys 


Gly 


Leu 


He 


Gin 


Leu 


Gin 




690 










o u 










700 










Gly Met 


Arg 


Tyr 


Gly 


Thr 




Cvs 


Ala 


Cys 


Ala 


Ser 


Thr Gly Gly 


Leu 


705 










Tin 

/ 1 V 










715 










720 


Val 


Asp 


ini 


Tie 
lie 


Tip 
lie 


VJ1U 


\J 1 jr 


l.VC 

u y w 


Thr Gly 


Phe 


His 


Met 


Gly 


Arg 


Leu 


















730 










735 




Ser 


Val 


Asp 


"7 4 0 


nail 


Va 1 
Val 


Val 


Glu 


Pro 

745 


Ala 


Asp 


Val 


Lys 


Lys 
750 


Val 


Ala 


Thr 


Thr 


Leu 


Gin 


Arg 


Ala 


lie 


Lys 


Val 


Val 


Gly Thr 


Pro 


Ala 


Tyr 


Glu 






"7 C C 

7 5 5 










760 










765 








Glu 


Met 
770 


Val 


Arg 


Asn 


Cys 


Met 
775 


lie 


Gin 


Asp 


Leu 


Ser 
780 


Trp 


Lys 


Gly 


Pro 


Ala 


Lys 


Asn 


Trp 


Glu 


Asn 


Val 


Leu 


Leu 


Ser 


Leu 


Gly 


Val 


Ala 


Gly 


Gly 


785 








790 










795 










800 


Glu 


Pro 


Gly 


Val 


Glu 


Gly 


Glu 


Glu 


lie 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu 


Asn 








805 










810 










815 




Val 


Ala 


Ala 


Pro 
820 



























<210> 17 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 17 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser 



<210> 18 
<23 1> 444 
<212> PRT 
<213> Thermotoga 

<400> 18 
Met Ala Glu Phe 
1 

Glu Ser Thr Asn 

20 

He Asp Gly Lys 
35 



marit ima 



Phe Pro Glu He 
5 

Pro Leu Ala Phe 

Pro Leu Lys Asp 

40 



Pro Lys He Gin 
10 

Arg Phe Tyr Asp 
25 

His Leu Lys Phe 
187 



Phe Glu Gly Lys 
15 

Pro Asn Glu Val 
30 

Ser Val Ala Phe 
45 
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T y-o 
I i Jy 


Hie 
nib 


l nr 


Php 

r lie 




j U 






Ala 

Ala 


VJ i U 


Mrg 


Pro 
r x w 










Mia 


A 

nty 


Vdl 


Son 


Tyr 


Phe 


Cys 


Phe 








100 


Arg 


p l ii 


i n r 


A a A 
Moll 






TIC 

lib 




Hi g 


rIC L 




A c n 
nb[J 




1 T A 






trie 


O ^ v 

be r 


MIS 


D v" /"v 


14 5 








ASp 


Val 


rile 


Mia 


Thr 


Lys 


Glu 


Leu 








180 


oiy 


iyr 


ulU 


Th v 

i nr 






T A C 

195 




Leu 




Arg 


Pne 




2 10 






Pne 


i nr 


oiy 


ui n 


225 








His 


Gin 


Tyr 


ASp 


His 


Gly 


Leu 


Asp 








260 


Thr 


Leu 


Ai a 


kj i y 






2 75 




Leu 


Gly 


Lys 


Leu 




290 






Gly Trp 


hen 
nap 


Th v 
lilt 


305 








Ala 


Met 


ryr 




Asn 


Phe 


Asp 


Ala 








340 


Phe 


He 


biy 


nio 






355 




lie 


Ala 


Tyr 


Lys 




370 






Glu 


Lys 


Tyr 


Arg 


385 








Gly 


Lys 


Thr 


Asp 


Asp 


lie 


Glu 


Leu 



420 

Asn Ser Tyr He 
435 



<210> 19 
<211> 1335 



Val Asn Glu Gly 
55 

Trp Asn Arg Phe 
70 



Ala Leu Phe Glu 
85 



Ui e 

nio 


M o u 


mi y 




jjy s 


Tip 

lie 


1_*C u 


Mo U 








1 £ V 


Cor* 


Sen 
Mali 


VOX 


T .\/Q 










Arg 


Tyr 


Met 


His 




150 






Tyr 


Ala 


Ala 

Aia 


Ala 

Aia 


1 f c 

16 5 








p i w 

U i y 


p i \i 
biy 


p. 1 1 1 

VjlU 




Leu 


Leu 


Asn 


*"P V* V 

i nr 








>~) A A 


T ... 

Leu 


Arg 


roe c 


Ala 

mi a 






z lb 




Phe 


Leu 


He 


Glu 




230 






Pne 


ASp 


va i 


Ala 

Aia 


24 5 








pi ,, 
CjIU 


Tyr 


File 


by s 


HIS 


i nr 


rile 


pin 








~> Q A 
£ 0 U 


oiy 


be r 


lie 


A c r"\ 
Mfap 






z y b 




Asp 


Gin 


Phe 


Pro 




310 






vai 


Tip 

lie 


ijy s 


Ala 
Ml a 


325 








Lys 


Val 


Arg 


Arg 


lie 


Ala 


Gly 


Met 








360 


Leu 


Ala 


Lys 


Asp 






375 




Ser 


Phe 


Lys 


Glu 




390 






Phe 


Glu 


Lys 


Leu 


405 








Pro 


Ser 


Gly 


Lys 


Val 


Lys 


Thr 


He 








440 



Arg 


Asp 


Pro 


Phe 








60 


^p r 




Pro 


Met 










Phe 


Cvs 


Glu 


Lvs 




90 






He 

x *x w 


Ala 


Pro 


Glu 


lu J 








Lvs 


Val 


Val 


Glu 


Leu 


Leu 


Trp 


Gly 








140 


vj i y 


Mia 


Ala 

Mid 


Thr 

1X11 






1 cc 




Gin 


Va 1 

v a x 


jj y o 


Lvs 




1 / U 






iyr 


vdl 


r 11 C 


Trn 


lab 








MSp 




wty 


r iPi i 


Val 


Glu 


Tyr 


Ala 








220 


Fro 


bys 


D v /"s 
rlO 


jjy a 






/ J 5 




i nr 


Ala 
Ala. 


Tyr 


Mia 




*i C A 

£ bO 






r lie 


no 1 1 


Tie 

IXC 


Glu 

vJlU 


£ o b 








His 

Ii X o 




Lpu 

U \-l 


Ara 


Ala 


Asn 


Gin 


Gly 








300 


Thr 




He 

X -L. C 


Tvr 






J 1 3 




Glv 


Glv 
vjiy 


Phe 

Lit W 


Thr 




1 1 n 

J J V 






Ml a 


OC 1 


iyr 


xj y a 


1 A c 








Ron 
Mop 


Thr 


Php 

"lie 


Ala 


o j. y 


Val 
vox 


Php 

rue 


Asn 








380 


Gly 


He 


Gly 


Lys 






395 




Glu 


Glu 


Tyr 


He 




410 






Gin 


Glu 


Tyr 


Leu 


425 








Ala 


Glu 


Leu 


Arg 



188 



Gly Asp Pro Thr 

Asp Lys Ala Phe 

80 

Leu Asn He Glu 
95 

Gly Lys Thr Leu 
110 

Arg He Lys Glu 
125 

Thr Ala Asn Leu 

Thr Cys Ser Ala 

160 

Ala Leu Glu He 
175 

Gly Gly Arg Glu 
190 

Glu Leu Glu Asn 
205 

Lys Lys He Gly 

Glu Pro Thr Lys 

240 

Phe Leu Lys Asn 
255 

Ala Asn His Ala 
270 

Met Ala Arg He 
285 

Asp Leu Leu Leu 

Asp Thr Thr Leu 

320 

Lys Gly Gly Leu 
335 

Val Glu Asp Leu 
350 

Leu Gly Phe Lys 
365 

Lys Phe He Glu 

Glu He Val Glu 

400 

He Asp Lys Glu 
415 

Glu Ser Leu Leu 
430 
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<212> DNA 

<213> Thermotoga maritima 



<400> 19 

atggccgagt tcttcccgga gatcccgaag atccagttcg agggcaagga gtccaccaac 60 

ccgctcgcct tccgcttcta cgacccgaac gaggtgatcg acggcaagcc gctcaaggac 120 

cacctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 180 

ggcgacccga ccgccgagcg cccgtggaac cgcttctccg acccgatgga caaggccttc 240 

gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 

cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 

aaggtggtgg agcgcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 420 

accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 

gacgtgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 540 

ggcggcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 

gacctcggcc tggagctgga gaacctcgcc cgcttcctcc gcatggccgt ggagtacgcc 660 

aagaagatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 720 

caccagtacg acctcgacgt ggccaccgcc tacgccttcc tcaagaacca cggcctcgac 780 

gagtacttca agttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 

cacgagctgc gcatggcccg catcctcggc aagctcggct ccatcgacgc caaccagggc 900 

gacctcctcc tcggctggga caccgaccag ttcccgacca acatctacga caccaccctc 960 

gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 

gacaccttcg ccctcggctt caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac 1140 

aagttcatcg aggagaagta ccgctccttc aaggagggca tcggcaagga gatcgtggag 1200 

ggcaagaccg acttcgagaa gctggaggag tacatcatcg acaaggagga catcgagctg 1260 

ccgtccggca agcaggagta cctggagtcc ctcctcaact cctacatcgt gaagaccatc 1320 

gccgagctgc gctga 1335 



<210> 20 

<211> 444 

<212> PRT 

<213> Thermotoga neapolitana 



<400> 20 




Met 


Ala 


Glu 


Phe 


1 








Glu 


Ser 


Thr 


Asn 








20 


lie 


Asp 


Gly 


Lys 






35 




Trp 


His 


Thr 


Phe 




50 






Ala 


Asp 


Arg 


Pro 


65 








Ala 


Arg 


val 


Asp 


Tyr 


Phe 


Cys 


Phe 








100 


Arg 


Glu 


Thr 


Asn 






115 




Arg 


Met 


Lys 


Asp 




130 






Phe 


Ser 


His 


Pro 


145 









Phe 


Pro 


Glu 


He 


5 








Pro 


Leu 


Ala 


Phe 


Pro 


Leu 


Lys 


Asp 








40 


Val 


Asn 


Glu 


Gly 






55 




Trp 


Asn 


Arg 


Tyr 




70 






Ala 


Leu 


Phe 


Glu 


85 








His 


Asp 


Arg 


Asp 


Lys 


He 


Leu 


Asp 








120 


Ser 


Asn 


Val 


Lys 






135 




Arg 


Tyr 


Met 


His 




150 







Pro Lys Val Gin 
10 

Lys Phe Tyr Asp 
25 

His Leu Lys Phe 

Arg Asp Pro Phe 

60 

Thr Asp Pro Met 
75 

Phe Cys Glu Lys 
90 

He Ala Pro Glu 
105 

Lys Val Val Glu 

Leu Leu Trp Gly 

140 

Gly Ala Ala Thr 
155 

189 



Phe Glu Gly Lys 
15 

Pro Glu Glu He 
30 

Ser Val Ala Phe 
45 

Gly Asp Pro Thr 

Asp Lys Ala Phe 

80 

Leu Asn He Glu 
95 

Gly Lys Thr Leu 
110 

Arg He Lys Glu 
125 

Thr Ala Asn Leu 

Thr Cys Ser Ala 

160 
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Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 










165 








Thr 


Lys 


Glu 


Leu Gly 


Gly Glu 


Gly 








180 










Gly Tyr Glu 


Thr 


Leu 


Leu 


Asn 


Thr 






195 










200 


Leu 


Ala 


Arg 


Phe 


Leu 


Arq 


Met 


Ala 




210 










215 




Phe Thr Gly 


Gin 


Phe 


Leu 


He 


Glu 


225 










230 






His 


Gin 


Tyr 


ASp 


Phe 


Asd 


Val 


Ala 










245 








His 


Gly 


Leu 


Asp 


Glu 


Tvr 


Phe 


Lvs 








260 










Thr 


Leu 


Ala 


Gly His 


Thr 


Php 


Gin 






275 










2 80 

A V v 


Leu 

V— V« 


Gly 


Lys 


Leu 


Gly 


W> W X 


He 


Asd 




290 










295 






Trp 


Asp 


Thr 


Asp 


Gin 


Phe 

nit 


Pro 


305 










310 






Ala 


Met 


Tyr 


Glu 


Val 


He 

X X 


Lvs 


Ala 










325 








Asn 


Phe 


Asp 


Ala 


Lys 


Val 

v a x 


Aro 


Arq 








340 










Phe 


lie Gly 


His 


He 


Ala 

hi a 


<ji y 


Met 






355 










360 

www 


Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asd 




370 










375 




Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Arg 


Glu 


385 










390 






Gly 


Lys 


Val 


Asp 


Phe 


Glu 


Lys 


Leu 










405 








Thr 


lie 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 








420 










Asn 


Ser 


Tyr 


lie 


Val 


Lys 


Thr 


He 






435 










440 



Gin 


Val 


Lys 


Lys 


Ala 


Leu 


Glu 


He 




170 










175 




Tvr 


val 


Phe 


Trp 


Gly 


Gly 


Arq 


Glu 

™* WA 


185 










190 






Asd 


Leu 


Glv 


Phe 


Glu 


Leu 


Glu 


Asn 










205 

fc* W w 








Val 


ASD 


Tvr 


Ala 


LVS 


Arq 


He 


Glv 








220 

w b w 










Pro 


Lvs 


Pro 


Lvs 


Glu 


Pro 


Thr 


Lvs 

xv jr a 






23 5 

c» w w 7 










240 


Thr 


Ala 


Tvr 

A / X 


Ala 

■ • -A- w* 


Phe 


Leu 


Lvs 


Ser 




250 

fit W w 










2 5 5 
t j j 




Phe 


Asn 


He 


Glu 


Ala 


Asn 


His 


Ala 


265 

& \J J 










270 

€* r w 






His 


Glu 


Leu 


Ara 


Met 


Ala 


Arq 


He 










£ O J 








Ala 


Asn 


Gin 


Glv 


Asd 


Leu 

W W M 


Leu 

*w w w* 


Leu 








3 00 

J V V 










Thr 


Asn 


Val 


Tvr 

* jr x 


Asd 


Thr 


Thr 

>- * * -Xi 


Leu 






J X -J 










3^0 

J Cm V 


Glv 


Glv 


Phe 


Thr 


Lvs 


Glv 


Glv 

\JX / 


Leu 

*w W* 




3 3 0 

•J w V 










3 3 5 

j j j 




Ala 


Ser 


Tyr 


Lys 


Val 


Glu 


Asd 


Leu 


34 5 

V T V 










3 50 

■j *j \j 






Asd 


Thr 
tin 


Phe 


Ala 




Glv 
ux y 


Phe 

rue 


u y o 










365 








Glv 


Val 


Leu 


Asp 


Lvs 


Phe 


He 

4- J* W 


Glu 








380 










Gly 


He 


Gly Arg 


Asp 


He 


Val 


Glu 






395 










400 


Glu 


Glu 


Tyr 


He 


He 


Asp 


Lys 


Glu 




410 










415 




Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


He 


425 










430 






Leu 


Glu 


Leu 


Arg 











<210> 21 
<211> 1335 
<212> DNA 

<213> Thermotoga neapolitana 

<400> 21 

atggccgagt tcttcccgga gatcccgaag gtgcagttcg agggcaagga gtccaccaac 60 
ccgctcgcct tcaagttcta cgacccggag gagatcatcg acggcaagcc gctcaaggac 120 
cacctcaagt tctccgtggc cttctggcac accttcgtga aegagggecg cgacccgttc 180 
ggcgacccga ccgccgaccg cccgtggaac cgctacaccg acccgatgga caaggccttc 240 
gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 
cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 
aaggtggtgg agegcatcaa ggagcgcatg aaggactcca aegtgaaget cctctggggc 420 
accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 
gacgtgttcg cctacgccgc cgcccaggtg aagaaggece tggagatcac caaggagctg 54 0 
ggeggegagg gctacgtgtt ctggggcggc cgegaggget acgagaccct cctcaacacc 600 

190 
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gacctcggct tcgagctgga gaacctcgcc cgcttcctcc gcatggccgt ggactacgcc 660 

aagcgcatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 720 

caccagtacg acttcgacgt ggccaccgcc tacgccttcc tcaagtccca cggcctcgac 780 

gagtacttca agttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 

cacgagctgc gcatggcccg catcctcggc aagctcggcc ccatcgacgc caaccagggc 900 

gacctcctcc tcggctggga caccgaccag ttcccgacca acgtgtacga caccaccctc 960 

gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 

gacaccttcg ccctcggctt caaggtggcc tacaagctcg tgaaggacgg cgtgctcgac li40 

aagttcatcg aggagaagta ccgctccttc cgcgagggca tcggccgcga catcgtggag 1200 

ggcaaggtgg acttcgagaa gctggaggag tacatcatcg acaaggagac catcgagctg 1260 

ccgtccggca agcaggagta cctggagtcc ctcatcaact cctacatcgt gaagaccatc 1320 

ctggagctgc gctga 1335 

<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic 

<400> 22 

agcgaattca tggcggctct ggccacgt 28 

<210> 23 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 23 

agctaagctt cagggcgcgg ccacgttct 29 

<210> 24 

<211> 825 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 24 


























Met Arg 


val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala Ala 


Ser 


1 






5 










10 








15 




Ala Thr 


Ser 


Ala 


Gly 


His 


Trp 


Tyr 


Lys 


His 


Gin 


Arg 


Ala 


Tyr Gin 


Phe 






20 










25 










30 




Thr Gly 


Glu 


Asp 


Asp 


Phe 


Gly 


Lys 


Val 


Ala 


Val 


Val 


Lys 


Leu Pro 


Met 




35 










40 










45 






Asp Leu 


Thr 


Lys 


Val 


Gly 


He 


He 


Val 


Arg 


Leu 


Asn 


Glu 


Trp Gin 


Ala 


50 










55 










60 








Lys Asp 


Val 


Ala 


Lys 


Asp 


Arg 


Phe 


He 


Glu 


He 


Lys 


Asp Gly Lys 


Ala 



191 
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Glu 


Val 


Trp 


lie 


Asp 


Thr 


Ser 


Pro 








100 


lie 


Glu 


Ala 


Phe 






115 




Phe 


Lys 


Val 


Thr 




130 






Lys 


Ala 


Asp 


Pro 


145 








Leu 


Ser 


Glu 


Ser 


lie 


He 


Glu 


Gly 








180 


Asp 


Asp 


Tyr 


Tyr 






195 




Lys 


Thr 


He 


Phe 




210 






Leu 


Leu 


Phe 


Lys 


225 








Met 


Glu 


Tyr 


Lys 


Leu 


Asp 


Gly 


Val 








260 


Arg 


Thr 


Thr 


Val 






275 




Glu 


Ser 


Ala 


val 




290 






Asn 


Asp 


Arg 


Gly 


305 








Glu 


He 


His 


He 


Asn 


Lys 


Gly 


Leu 








34 0 


Gly 


Gly 


Val 


Thr 






355 




His 


Val 


His 


He 




370 






Asp 


Lys 


Asp 


Phe 


385 








Phe 


Met 


Val 


Pro 


Thr 


Arg 


He 


Arg 








420 


Gly 


lie 


Gly 


Val 






435 




Gly 


Glu 


Leu 


Ser 




450 






lie 


Asp 


Lys 


Thr 


465 








lie 


Ala 


Ser 


Glu 


Thr 


Tyr 


Trp 


Val 





70 






Leu 


Gin 


Gly 


Va 1 


85 








Arg 


He 


Pne 


Phe 


Leu 


Thr 


Asn 


Pro 








120 


tt_ T 

Val 


Asp 


Gly 


Lys 






135 




Thr 


Asp 


He 


Asp 
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<210> 25 
<211> 2478 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 25 

atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 60 
ggccactggt acaagcacca gcgcgcctac cagttcaccg gegaggaega ettegggaag 120 

193 



WO 21)05/096804 



PCT/US2004/007182 



gtggccgtgg tgaagctccc gatggacctc accaaggtgg gcatcatcgt gcgcctcaac 180 
gagtggcagg cgaaggacgt ggccaaggac cgcttcatcg agatcaagga cggcaaggcc 240 
gaggtgtgga tactccaggg cgtggaggag atcttctacg agaagccgga cacctccccg 300 
cgcatcttct tcgcccaggc ccgctccaac aaggtgatcg aggccttcct caccaacccg 360 
gtggacacca agaagaagga gctgttcaag gtgaccgtcg acggcaagga gatcccggtg 420 
tcccgcgtgg agaaggccga cccgaccgac atcgacgtga ccaactacgt gcgcatcgtg 480 
ctctccgagt ccctcaagga ggaggacctc cgcaaggacg tggagctgat catcgagggc 540 
tacaagccgg cccgcgtgat catgatggag atcctcgacg actactacta cgacggcgag 600 
ctgggggcgg tgtactcccc ggagaagacc atcttccgcg tgtggtcccc ggtgtccaag 660 
tgggtgaagg tgctcctctt caagaacggc gaggacaccg agccgtacca ggtggtgaac 720 
atggagtaca agggcaacgg cgtgtgggag gccgtggtgg agggcgacct cgacggcgtg 780 
ttctacctct accagctgga gaactacggc aagatccgca ccaccgtgga cccgtactcc 840 
aaggccgcgt acgccaacaa ccaggagtct gcagtggtga acctcgcccg caccaacccg 900 
gagggctggg agaacgaccg cggcccgaag atcgagggct acgaggacgc catcatctac 960 
gagatccaca tcgccgacat caccggcctg gagaactccg gcgtgaagaa caagggcctc 1020 
tacctcggcc tcaccgagga gaacaccaag gccccgggcg gcgtgaccac cggcctctcc 1080 
cacctcgtgg agctgggcgt gacccacgtg cacatcctcc cgttcttcga cttctacacc 1140 
ggcgacgagc tggacaagga cttcgagaag tactacaact ggggctacga cccgtacctc 1200 
ttcatggtgc cggagggccg ctactccacc gacccgaaga acccgcacac ccgaattcgc 1260 
gaggtgaagg agatggtgaa ggccctccac aagcacggca tcggcgtgat cacggacatg 1320 
gtgttcccgc acacctacgg catcggcgag ctgtccgcct tcgaccagac cgtgccgtac 1380 
tacttctacc gcatcgacaa gaccggcgcc tacctcaacg agtccggctg cggcaacgtg 1440 
atcgcctccg agcgcccgat gatgcgcaag tccatcgtgg acaccgtgac ctactgggtg 1500 
aaggagtacc acatcgacgg cttccgcttc gaccagatgg gcctcatcga caagaagacc 1560 
atgctggagg tggagcgcgc cctccacaag atcgacccga ccatcatcct ctacggcgag 1620 
ccgtggggcg gctggggggc cccgatccgc ttcggcaagt ccgacgtggc cggcacccac 1680 
gtggccgcct tcaacgacga gttccgcgac gccatccgcg gctccgtgtt caacccgtcc 1740 
gtgaagggct tcgtgatggg cggctacggc aaggagacca agatcaagcg cggcgtggtg 1800 
ggctccatca actacgacgg caagctcatc aagtccttcg ccctcgaccc ggaggagacc 1860 
atcaactacg ccgcctgcca cgacaaccac accctctggg acaagaacta cctcgccgcc 1920 
aaggccgaca agaagaagga gtggaccgag gaggagctga agaacgccca gaagctcgcc 1980 
ggcgccatcc tcctcactag tcagggcgtg ccgttcctcc acggcggcca ggacttctgc 2040 
cgcaccacca acttcaacga caactcctac aacgccccga tctccatcaa cggcttcgac 2100 
tacgagcgca agctccagtt catcgacgtg ttcaactacc acaagggcct catcaagctc 2160 
cgcaaggagc acccggcctt ccgcctcaag aacgccgagg agatcaagaa gcacctggag 2220 
ttcctcccgg gcgggcgccg catcgtggcc ttcatgctca aggaccacgc cggcggcgac 2280 
ccgtggaagg acatcgtggt gatctacaac ggcaacctgg agaagaccac ctacaagctc 2340 
ccggagggca agtggaacgt ggtggtgaac tcccagaagg ccggcaccga ggtgatcgag 2400 
accgtggagg gcaccatcga gctggacccg ctctccgcct acgtgctcta ccgcgagtcc 2460 
gagaaggacg agctgtga 2478 

<210> 26 
<211> 716 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 26 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Met Glu Thr He Lys He Tyr Glu Asn Lys Gly Val Tyr 

20 25 30 
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Phe 


Pro 


Phe 


Tvr 




4 90 










4 95 




lie 

X * w 


Asd 


Thr 


Glu 


Pro 


Val 


Phe 


Leu 


505 

J V J 










510 

W X w 








Glu 


He 

X X v— ■ 


Val 

V IX X 


Glu 


Leu 

*J w vx 


Ara 


Tvr 

x y x 










525 

-J -J 








Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lvs 








54 0 

J "T V 










Phe 


Tvr 


Glu 


Phe 


Gin 


Asd 


Asp 


Asp 






555 
3 3 3 










560 

■J \J \J 


Tvr 
i yi 


Met 


Val 

V t4 X 


Glv 

vjxy 


Lvs 


Tvr 

* y x 


Leu 


Leu 




5 7 0 

3 ' V 










57 5 

J f mi 




m u 

u i U 


Car 

JC J- 


A TCI 


{.on 


Val 

V U X 


Thr 

X 1 1 X 


Leu 


Pro 


5 R5 










590 
^ -j \j 






Sen 


Gl v 


Glu 

VJ AU 


He 

x x w 


He 

x x w 


Asn 


Glv 


Lvs 










605 

w V w 








Leu 

■kJ w L* 


Pro 


He 


Tvr 


Leu 


Arq 


Glu 


Gly 

w * 7 








62 0 

\J C* V 










Glu 

VJ X ix 


Leu 

■LJ 14 


He 

X X *w 


Val 

" V* X 


Tvr 


Glv 


Glu 


Thr 






615 

U 3 3 










640 

w T w 


Glu 

VJ X u 


He 

X X w 


Thr 


Ser 

W w X 


Ser 


Ser 


Asn 


Glu 




650 










655 




Val 


Ser 


Lys 


Leu 


Thr 


He 


Thr 


Ser 


665 










670 






Val 


Asp 


Asp 


Ser 


Lys 


Glu 


lie 


Gin 










685 








Tyr 


Val 


Ala 


Lys 


lie 


Asn 


Gin 


Lys 








700 










Ser 


Glu 


Lys 


Asp 


Glu 


Leu 







<210> 27 

<211> 712 

<212> PRT 

<2 1 3 > Artificial Sequence 
<220> 

< 2 2 3 > synthetic 



<400> 27 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


Thr 


Ser 


Met 


Glu 


Thr 


He 


Lys 


lie 


Tyr 


Glu 


Asn 


Lys 


Gly 


Val 


Tyr 








20 










25 










30 






Lys 


Val 


Val 


lie 


Gly 


Glu 


Pro 


Phe 


Pro 


Pro 


lie 


Glu 


Phe 


Pro 


Leu 


Glu 




35 










40 










45 








Gin 


Lys 


lie 


Ser 


Ser 


Asn 


Lys 


Ser 


Leu 


Ser 


Glu 


Leu 


Gly 


Leu 


Thr 


lie 




50 










55 










60 










Val 


Gin 


Gin 


Gly 


Asn 


Lys 


Val 


lie 


Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 


65 










70 










75 










80 


Glu 


His 


lie 


lie 


Gly 


Leu 


Gly Glu 


Lys 


Ala 


Phe 


Glu 


Leu 


Asp 


Arg 


Lys 



196 
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Arg 


Lys 


Arg 


Tyr 








100 


Tyr 


Gin 


Asp 


Pro 






115 




Asp Gly 


Val 


Ala 




130 






Phe 


Asp 


Val 


Glv 


145 








Glu 


Asp 


Ser 


Val 


Val 


Leu 


Glu 


Lys 








180 


Met 


Trp 


Ala 


Phe 






195 




Asp 


Lys 


Val 


Val 




210 






Val 


Ala 


Gl v 


Va 1 

vox 


225 








Phe 


Thr 


TrD 


His 


Glu 


Leu 


His 


Lys 








260 


lie 


Arg 


Val 


Asp 






275 




Phe 


Cys 


Glu 


lie 




290 






Gly Thr 


Thr 


Val 


305 








Trp 


Ala 


Glv 


Leu 


Trp 


Leu 


Asp 


Met 








340 


Arg 


Asp 


Val 


Leu 






355 




Val 


Thr 


Thr 


Phe 




370 






Val 


Lys 


His 


Glu 


385 








Ala 


Thr 


Phe 


Lvs 


Leu 


Ser 


Arg 


Ala 








420 


Thr Gly 


Asp 


Asn 






435 




Leu 


Val 


Leu 


Gly 




450 






lie 


Gly 


Gly 


Phe 


465 








Asp 


Leu 


Leu 


Val 


Arg 


Ser 


His 


Lys 








500 


Pro 


Asp 


Tyr 


Tyr 



85 

Val Met Tyr Asn 

Leu Tyr Val Ser 

120 

Thr Gly Tyr Phe 
135 

Leu Glu Glu Tyr 
150 

Glu Phe Tyr Val 
165 

Tyr Thr Glu Leu 

Gly Tyr Met lie 

200 

Glu Leu Val Asp 
215 

Phe Leu Asp lie 
230 

Pro Tyr Arg Phe 
245 

Arg Asn Val Lys 

Gin Asn Tyr Ser 

280 

Glu Ser Gly Glu 
295 

Tyr Pro Asp Phe 
310 

He Ser Glu Trp 
325 

Asn Glu Pro Thr 

Ser Ser Leu Pro 

360 

Pro Asp Asn Val 
375 

Lys Val Arg Asn 
390 

Gly Phe Arg Thr 
405 

Gly Tyr Ala Gly 

Thr Pro Ser Trp 

440 

Leu Ser He Ser 
455 

Gin Gly Arg Asn 
470 

Lys Tyr Tyr Ala 
485 

Ala Thr Asp Gly 
Lys Glu Lys Val 





90 






Val 


Asp 


Ala 


Gly 


105 








He 


Pro 


Leu 


Phe 


Phe 


Asn 


Ser 


Ala 








140 


Asp 


Lys 


val 


lie 






155 




He 


Glu 


Gly 


Pro 




170 






Thr Gly 


Lys 


Pro 


185 








Ser 


Arg 


Tyr 


Ser 


He 


Met 


Gin 


Lys 








220 


His 


Tvr 


Met 


Asp 






235 




Pro 


Glu 


Pro 


Lys 




250 






Leu 


He 


Thr 


He 


265 








Pro 


Phe 


Leu 


Ser 


Leu 


Phe 


Val 


Gly 








300 


Phe 


Arg 


Glu 


Asp 






315 




Leu 


Ser 


Gin 


Gly 




330 






Asp 


Phe 


Ser 


Arg 


345 








Val 


Gin 


Phe 


Arg 


Val 


His 


Tyr 


Leu 








380 


Ala 


Tyr 


Pro 


Leu 






395 




Ser 


His 


Arg 


Asn 




410 






He 


Gin 


Arg 


Tyr 


425 








Asp 


Asp 


Leu 


Lys 

4 


Gly Val 


Pro 


Phe 








460 


Phe 


Ala 


Glu 


He 






475 




Leu 


Ala 


Leu 


Phe 




4 90 






lie 


Asp 


Thr 


Glu 


505 








Lys 


Glu 


He 


Val 







95 




Ala 


Tyr. 


Lys 


Lys 




110 






He 


Ser 


Val 


Lys 

* 


125 








Ser 


Lys 


Val 


lie 


Val 


Thr 


He 


Pro 








160 


Arg 


He 


Glu 


Asp 






175 




Phe 


Leu 


Pro 


Pro 




190 






Tyr 


Tyr 


Pro 


Gin 


205 








Glu 


Gly 


Phe 


Arg 

9MMW 


Ser 


Tyr 


Lys 


Leu 








240 


Lys 


Leu 


He 


Asp 

* 






255 




Val 


Asp 


His 


Gly 




270 






Gly 

4 


Met 


Gly 


Lys 


285 








Lys 


Met 


Trp 


Pro 


Thr 


Arg 


Glu 


Trp 








320 


Val 


Asp 

IT 


Gly 


He 






335 




Ala 


lie 


Glu 


He 




350 






Asp 


Asp 


Arg 


Leu 


365 








Arg 


Gly 


Lys 

* 


Arg 


Tyr 


Glu 


Ala 


Met 








400 


Glu 


He 


Phe 


He 






415 




Ala 


Phe 


He 


Trp 




430 






Leu 


Gin 


Leu 


Gin 


445 








Val 


Gly 


Cys 

43 


Asp 


Asp 


Asn 


Ser 


Met 








480 


Phe 


Pro 


Phe 


Tyr 






495 




Pro 


Val 


Phe 


Leu 




510 






Glu 


Leu 


Arg 


Tyr 
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515 










520 










525 


*• 






Lys 


Phe 


Leu 


Pro Tyr 


He 


Tyr 


Ser 


Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lys 




530 










535 










540 








Gly His 


Pro 


Val 


He 


Arg 


Pro 


Leu 


Phe 


Tyr 


Glu 


Phe 


Gin 


Asp Asp Asp 


545 










550 










555 










560 


Asp 


Met 


Tyr 


Arg 


He 
565 


Glu 


Asp 


Glu 


Tyr 


Met 

570 


Val 


Gly 


Lys 


Tyr 


Leu 
575 


Leu 


Tyr 


Ala 


Pro 


He 
580 


Val 


Ser 


Lys 


Glu 


Glu 
585 


Ser 


Arg 


Leu 


Val 


Thr 
590 


Leu 


Pro 


Arg Gly 


Lys 


Trp 


Tyr 


Asn 


Tyr 


Trp Asn Gly Glu 


He 


He 


Asn 


Gly 


Lys 






595 










600 










605 






Ser 


Va 1 


Val 


Lys 


Ser 


Thr 


His 


Glu 


Leu 


Pro 


He 


Tyr 


Leu 


Arg 


Glu Gly 




610 










615 










620 










Ser 


He 


Tie 


Pro 


Leu 


Glu 


Gly 


Asp 


Glu 


Leu 


He 


Val 


Tyr Gly 


Glu 


Thr 


625 










630 










635 










640 


Ser 


Phe 


Lys 


Arg 


Tyr 
645 


Asp 


Asn 


Ala 


Glu 


He 
650 


Thr 


Ser 


Ser 


Ser 


Asn 
655 


Glu 


lie 


Lys 


Phe 


Ser 
660 


Arg 


Glu 


He 


Tyr 


Val 
665 


Ser 


Lys 


Leu 


Thr 


lie 
670 


Thr 


Ser 


Glu 


Lys 


Pro 
675 


Val 


Ser 


Lys 


He 


He 
680 


Val 


Asp 


Asp 


Ser 


Lys 
685 


Glu 


lie 


Gin 


Val 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 


Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 




690 










695 










700 








lie 


Arg 


Gly 


Lys 


He 


Asn 


Leu 


Glu 


















705 










710 























<210> 28 

<2U> 469 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> synthetic 



<400> 28 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala Leu 


Ala Ala Ser 


1 








5 










10 






15 


Ala 


Thr 


Ser 


Met 


Ala 


Glu 


Phe 


Phe 


Pro 


Glu 


lie 


Pro Lys 


lie Gin Phe 








20 










25 








30 


Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


Asn 


Pro 


Leu 


Ala 


Phe 


Arg Phe 


Tyr Asp Pro 






35 










40 








45 




Asn 


Glu 


Val 


He 


Asp 


Gly 


Lys 


Pro 


Leu 


Lys 


Asp 


His Leu 


Lys Phe Ser 




50 










55 










60 




Val 


Ala 


Phe 


Trp 


His 


Thr 


Phe 


Val 


Asn Glu Gly Arg Asp Pro Phe Gly 


65 










70 










75 




80 


Asp 


Pro 


Thr 


Ala 


Glu 


Arg 


Pro 


Trp 


Asn 


Arg 


Phe 


Ser Asp 


Pro Met Asp 










85 










90 






95 


Lys 


Ala 


Phe 


Ala 


Arg 


Val 


Asp 


Ala 


Leu 


Phe 


Glu 


Phe Cys 


Glu Lys Leu 








100 










105 








110 


Asn 


lie 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 


Asp 


He Ala 


Pro Glu Gly 






115 










120 








125 




Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 


He 


Leu 


Asp 


Lys Val 


Val Glu Arg 




130 










135 










140 



198 
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lie 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly Thr 


145 










150 










155 










160 


Ala 


Asn 


Leu 

±J w W4 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 


His 


Gly Ala 


Ala 


Thr 


Thr 










165 

Jm Vi* 










170 










175 




Cys 


Ser 


Ala 


ASD 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 








X O \J 










185 










190 






Leu 


Glu 


lie 

A i- V 


Thr 


Lvs 


Glu 


Leu Gly Gly Glu Gly Tyr 


Val 


Phe 


Trp Gly 
















200 










205 








Gly Arg 


Glu 


Glv 


Tvr 


Glu 


Thr 


Leu 


Leu 


Jon 


1 ILL 


Ren 
no £J 


Leu Gly 


Leu 


Glu 

\J X u 




210 










215 










220 










Leu 


Glu 


A Q n 
noli 


T.ei 1 


Ala 


Ara 


Phe 


Leu 


Arg 


Met 


Ala 


Val 


Glu 


Tyr 


Al a 

r\ x a 


Lv*; 
xj y 0 


225 










C J V 










235 










24 0 


Lys 


lie 


Gl v 

\j x y 


Phe 

rue 


Thr 


Glv 


Gin 


Phe 


Leu 


lie 


Glu 


Pro 


Lys 


Pro 


jj y 0 


Gl 11 
uxu 








2 4 ^ 
*. *i j 










250 










2 55 




Pro 


Thr 


uy a 


H i q 
n j 0 


Gin 


Tvr 

y 


Asp 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Al a 

n x 0 


Phe 








9 0 










265 










270 






Leu 


Lys 


Acn 
/\ ft J * 


n x s 


G 1 v 
u x y 


T iPi 1 


Asp 


Glu 


Tyr 


Phe 


Lys 


Phe 


Asn 


lie 


Gl 11 


Al a 
















280 










285 








Asn 


His 


Ala 


Th r 

1 I 1 X 


T.pn 


Ala 


Gly 


His 


Thr 


Phe 


Gin 


His 


Glu 


Leu 


A TCI 
r\x y 


Met 




290 










295 










300 










Ala Arg 


Tie 

IXC 


T .P 11 


uxy 




Leu 


Gly 


Ser 


lie 


Asp 


Ala 


Asn 


Gin 


Glv 


Icn 


305 










J J. V 










315 










320 


Leu 


Leu 


i—i c u 


Glv 

\j x y 


Tyn 
1 J.p 




Thr Asp Gin 


Phe 


Pro 


Thr 


Asn 


He 


iy l 


Aso 










J 6 J 










330 










335 




Thr 


Thr 


T .0 1 1 
ucu 


Ala 


Met" 




Glu 


Val 


He 


Lys Ala Gly Gly Phe 


Thr 


Lys 








J *l V 










345 










350 






Gly Gly 


T .£1 1 1 

uc u 


&qn 
D 1 1 


Phe 


Q en 


Ala 


Lys 


Val 


Arg 


Arg 


Ala 


Ser 


Tyr 


Lys 


Val 






ICC 










360 










365 








Glu 


Asp 


Leu 

LJ w 


Phe 


I le 

X X *w 


Glv 


His 


He 


Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 




370 










375 










380 










Gly 


Phe 


uy 0 


lie 

x x e 


Ala 


Tvr 

y 


Lys 


Leu Ala Lys Asp Gly Val 


Phe 


Asp 


Lys 


385 










3 90 










395 










400 


Phe 


lie 


Glu 


Glu 


Lvs 


Tvr 


Arg 


Ser 


Phe 


Lys 


Glu Gly 


He 


Gly 


Lys 


Glu 










405 










410 










415 




lie 


Val 


Glu 


Gly 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu Tyr 


He 


lie 








420 










425 










430 






Asp 


Lys 


Glu 


Asp 


lie 


Glu 


Leu 


Pro 


Ser Gly Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 






435 










440 










445 








Ser 


Leu 


Leu 


Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 


He 


Ala 


Glu 


Leu 


Arg 


Ser 




450 










455 










460 










Glu 


Lys 


Asp 


Glu 


Leu 

























465 



<210> 29 
<2U> 469 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > synthetic 
<400> 29 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

199 



\ 
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15 10 15 

Ala Thr Ser Met Ala Glu Phe Phe Pro Glu He Pro Lys Val Gin Phe 

20 25 30 

Glu Gly Lys Glu Ser Thr Asn Pro Leu Ala Phe Lys Phe Tyr Asp Pro 

35 40 45 

Glu Glu He He Asp Gly Lys Pro Leu Lys Asp His Leu Lys Phe Ser 

50 55 60 

Val Ala Phe Trp His Thr Phe Val Asn Glu Gly Arg Asp Pro Phe Gly 
65 70 75 80 

Asp Pro Thr Ala Asp Arg Pro Trp Asn Arg Tyr Thr Asp Pro Met Asp 

85 90 95 

Lys Ala Phe Ala Arg Val Asp Ala Leu Phe Glu Phe Cys Glu Lys Leu 

100 105 110 

Asn He Glu Tyr Phe Cys Phe His Asp Arg Asp He Ala Pro Glu Gly 

115 120 125 

Lys Thr Leu Arg Glu Thr Asn Lys He Leu Asp Lys Val Val Glu Arg 

130 135 140 

He Lys Glu Arg Met Lys Asp Ser Asn Val Lys Leu Leu Trp Gly Thr 
145 150 155 160 

Ala Asn Leu Phe Ser His Pro Arg Tyr Met His Gly Ala Ala Thr Thr 

165 170 175 

Cys Ser Ala Asp Val Phe Ala Tyr Ala Ala Ala Gin Val Lys Lys Ala 

180 185 190 

Leu Glu He Thr Lys Glu Leu Gly Gly Glu Gly Tyr Val Phe Trp Gly 

195 200 205 

Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Gly Phe Glu 

210 215 220 

Leu Glu Asn Leu Ala Arg Phe Leu Arg Met Ala Val Asp Tyr Ala Lys 
225 230 235 240 

Arg He Gly Phe Thr Gly Gin Phe Leu He Glu Pro Lys Pro Lys Glu 

245 250 255 

Pro Thr Lys His Gin Tyr Asp Phe Asp Val Ala Thr Ala Tyr Ala Phe 

260 265 270 

Leu Lys Ser His Gly Leu Asp Glu Tyr Phe Lys Phe Asn He Glu Ala 

275 280 285 

Asn His Ala Thr Leu Ala Gly His Thr Phe Gin His Glu Leu Arg Met 

290 295 300 

Ala Arg He Leu Gly Lys Leu Gly Ser He Asp Ala Asn Gin Gly Asp 
305 310 315 320 

Leu Leu Leu Gly Trp Asp Thr Asp Gin Phe Pro Thr Asn Val Tyr Asp 

325 330 335 

Thr Thr Leu Ala Met Tyr Glu Val He Lys Ala Gly Gly Phe Thr Lys 

340 345 350 

Gly Gly Leu Asn Phe Asp Ala Lys Val Arg Arg Ala Ser Tyr Lys Val 

355 360 365 

Glu Asp Leu Phe He Gly His lie Ala Gly Met Asp Thr Phe Ala Leu 

370 375 380 

Gly Phe Lys Val Ala Tyr Lys Leu Val Lys Asp Gly Val Leu Asp Lys 
385 390 395 400 

Phe He Glu Glu Lys Tyr Arg Ser Phe Arg Glu Gly He Gly Arg Asp 

405 410 415 

lie Val Glu Gly Lys Val Asp Phe Glu Lys Leu Glu Glu Tyr He lie 

420 425 430 

Asp Lys Glu Thr lie Glu Leu Pro Ser Gly Lys Gin Glu Tyr Leu Glu 

200 
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435 440 445 

Ser Leu He Asn Ser Tyr He Val Lys Thr He Leu Glu Leu Arg Ser 

450 455 460 

Glu Lys Asp Glu Leu 
465 



<210> 30 

<2ll> 463 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 30 



Met 


Arg 


Val 


Leu 


Leu 


Va 1 


Ala 


Leu 


1 








5 








TV 1 — 

Ala 


Thr 


Ser 


Met 


Ala 


GlU 


Pne 


pne 








20 










Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


TV a hrft 

Asn 


Pro 






35 










4 0 


Glu 


Glu 


He 


He 


Asp 


Gly 


Lys 


^5 "V* 

pro 




50 










55 




Val 


Ala 


Pne 


Trp 


His 


I nr 


n W> 

pne 


va i 


65 










70 






Asp 


Pro 


Thr 


Ala 


Asp 


Arg 


T"\ -wb <n. 

pro 


Trp 










85 








V _ _ 

Lys 


Ala 


Phe 


Ala 


Arg 


va l 


Asp 


Ala 








100 










Asn 


He 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 






115 










120 


Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 




130 










135 




lie 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


145 










150 






Ala 


Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 










165 








Cys 


Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 








180 










Leu 


Glu 


He 


Thr 


Lys 


Glu 


Leu 


Gly 






195 










200 


Gly 


Arg 


Glu 


Gly Tyr 


Glu 


Thr 


Leu 




210 










215 




Leu 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


225 










230 






Arg 


He 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 










245 








Pro 


Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 








260 










Leu 


Lys 


Ser 


His 


Gly 


Leu 


Asp 


Glu 






275 










280 


Asn 


His 


Ala 


Thr 


Leu 


Ala 


Gly 


His 




290 










295 





Ala 




beu 


Ala 


Leu 


Ala 


Ala 


Ser 




1 0 










15 




r> v<» a. 

pro 


uIU 


i ie 


Pro 


Lys 


val 


Gin 


Phe 


2 5 










30 






Leu 


Aid 


trie 


Lys 


Phe 


Tyr 


Asp 


Pro 










45 








T AM 


bys 


ASp 


His 


Leu 


Lys 


Phe 


Ser 








60 










T\. <i v\ 

Asn 


pin 


uiy 


Arg 


Asp 


Pro 


Phe Gly 






/ D 










80 


T\ 

Hsn 


Mxy 


Tyr 


Thr 


Asp 


Pro 


Met 


Asp 




Q A 










95 




ucli 


pne 


o 1U 


Phe 


Cys 


Glu 


Lys 


Leu 


105 










110 






Asp 


Arg 


Asp 


He 


Ala 


Pro 


Glu 


Gly 










125 








He 


Leu 


Asp 


Lys 


Val 


Val 


Glu 


Arg 








140 










Asn 


Val 


Lys 


Leu 


Leu 


Trp Gly Thr 






155 










160 


Tyr 


Met 


His 


Gly Ala 


Ala 


Thr 


Thr 




170 










175 




Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


185 










190 






Gly 


Glu 


Gly 


Tyr 


Val 


Phe 


Trp Gly 










205 








Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Phe 


Glu 








220 










Arg 


Met 


Ala 


Val 


Asp 


Tyr 


Ala 


Lys 






235 










240 


Leu 


He 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 




250 










255 




Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 


265 










270 






Tyr 


Phe 


Lys 


Phe 


Asn 


lie 


Glu 


Ala 










285 








Thr 


Phe 


Gin 


His 


Glu 


Leu 


Arg 


Met 



300 

201 
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Ala 


Arg 


lie 


Leu Gly Lys 


Leu 


Gly 


305 










310 






Leu 


Leu 


Leu Gly Trp Asp 


1114. 


Ian 










325 








Thr 


Thr 


T on 
Lit; U 


nld 

340 




ryr 


Glu 


Val 


Gly Gly 


L»eu 




rnc 


HSp 


Ala 


Lys 






355 










1 & A 


Glu 


Asp 
370 


Leu 


Phe 


He 


Gly 


His 

1 f D 


lie 


Gly 


Phe 


Lys 


Val 


Ala 


Tyr 


Lys 


Leu 


385 










390 






Phe 


lie 


Glu 


Glu 


Lys 
405 


Tyr 


Arg 


Ser 


lie 


Val 


Glu 


Gly 

420 


Lys 


Val 


Asp 


Phe 


Asp 


Lys 


Glu 
435 


Thr 


lie 


Glu 


Leu 


Pro 

440 


Ser 


Leu 
450 


He 


Asn 


Ser 


Tyr 


lie 

455 


Val 



Ser 


He 


Asp Ala Asn Gin Gly Asp 






315 










320 


Gin 


Phe 

J -3 v 


Pro 


Thr 


Asn 


Val 


Tvr 
335 


ficn 
nop 


He 


Lys 


Ala Gly Gly Phe 


Th r 


T.\/Q 

uy a 












3 50 

J J w 






V CI X 




Arg 


Ala 


Ser 
365 


Tyr 


U Y O 


Val 

V CI X 


Ala 


Gly 


Met 


Asp 
380 


Thr 


Phe 


Ala 


J-J C u 


Va 1 




Asp 
395 


Gly 


Val 


Leu 


Asp 


Lys 
400 


Phe 


Arg 
410 


Glu 


Gly 


He 


Gly 


Arg 
415 


Asp 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


425 










430 






Ser 


Gly 


Lys 


Gin 


Glu 
445 


Tyr 


Leu 


Glu 


Lys 


Thr 


He 


Leu 


Glu 


Leu 


Arg 





460 



<210> 31 

<211> 25 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 31 

Met Gly Lys Asn Gly Asn Leu Cys 
1 5 

Leu Ala Gly Leu Ala Ser Gly His 

20 



Cys Phe Ser Leu Leu Leu Leu Leu 
10 15 

Gin 

25 



<210> 32 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 32 

Met Gly Phe Val Leu Phe Ser Gin 

1 5 
Thr Leu Leu Leu Phe Leu Val lie 

20 



Leu Pro Ser Phe Leu Leu Val Ser 

10 15 
Ser His Ser Cys Arg Ala 
25 30 



<210> 33 
<2H> 460 



202 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3 > synthetic 



<400> 33 



Met" 




Va 1 

vox 


w w U 


hLJ w LI 


Val 

V d ^ 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


1 1 1 L 


Car 

*9 C X 


Al a 


Lv s 

XJ V W 


Tvr 
i y x 


Leu 

U V- V* 


Glu 

Vrf* A 


Leu 


Glu 


Glu 


Gly Gly Val 


He 


Met 








20 










25 










30 






nln 


Ala 


Phe 


Tvr 


Trn 


Asd 


Val 

V W A 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 


Asd 


Thr 






35 










40 










45 








Tie 






Lv <^ 


lie 


Pro 


Glu 


Tro 


Tvr 


Asp 


Ala 


Gly 


He 


Ser 


Ala 


He 




50 










55 










60 










1 I. {J 


T 1 e 

X J. c 




Prn 


Ala 


Ser 

w C X 


Lvs 

w y w 


Glv 


Met 


Ser Gly Gly Tyr 


Ser 


Met 


Gly 


65 










70 










75 










80 


i y i 


A c 


r i u 


y 


ficn 


Tvr 
i y x 


Phe 

Lit W 


Asd 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 










85 










90 










95 




Thr 


v a l 


v 1 U 


Th r 

1 II L 


Ara 


Phe 

ill 


Glv 


Ser 


Lys 


Gin 


Glu 


Leu 


lie 


Asn 


Met 


lie 








100 

A W V/ 










105 










110 






no n 


Th r 


A 1 A 


His 


Ala 


Tvr 
i y x 


Glv 


lie 


Lys 


Val 


He 


Ala 


Asp 


lie 


val 


lie 






115 










120 










125 








Jen 
Moll 




" *■ y 


Ala 

Ik X d 


Glv 


Glv 


ASD 


Leu 

*mi W W* 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 




130 

.i V w 










135 










140 










i yr 


Th r 


Trn 


Thr 
i in 


Asn 


Phe 


Ser 


Lvs 


Val 


Ala 


Ser Gly 


Lys 


Tyr 


Thr 


Ala 


14 5 










150 










155 










160 


Zi c n 
Moll 


iyr 


uc u 




Phe 

r lit 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly Asp 


Ser Gly 










165 










170 










175 




Thr 


Phe 


Gl V 


Glv 

vjx y 


Tvr 


Pro 


Asd 


He 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 








180 










185 










190 






Tyr 


Trp 


Leu 

i-t W U 


Trn 
1 L F 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 






195 










200 










205 








lie 


Gly 


lie 


Asn 


Ala 


TrD 


Ara 


Phe 


Asp Tyr 


Val 


Lys 


Gly Tyr 


Gly Ala 




210 










215 










220 










Trp 


val 


Val 


Lvs 


Asd 


TrD 


Leu 


Asn 


Trp Trp Gly Gly Trp Ala 


Val 


Gly 


225 










230 










235 










240 


Glu 


Tyr 




Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 










245 










250 










255 




Ser 


Gly 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 








260 










265 










270 






Ala 


Phe 


Asp 


Asn 


Lys 


Asn 


lie 


Pro 


Ala 


Leu 


val 


Glu 


Ala 


Leu 


Lys 


Asn 






275 










280 










285 








Gly Gly 


Thr 


val 


Val 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 


Ala 


Val 


Thr 


Phe 


Val 




290 










295 










300 










Ala 


Asn 


His 


Asp 


Thr 


Asp 


lie 


lie 


Trp 


Asn 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 


305 










310 










315 










320 


Phe 


lie 


Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 










325 










330 










335 




Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 


Leu 


lie 


Trp 


lie 


His 








340 










345 










350 






Asp 


Asn 


Leu 


Ala Gly 


Gly 


Ser 


Thr 


Ser 


He 


Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 




355 










360 










365 








Glu 


Met 


lie 


Phe 


Val 


Arg 


Asn 


Gly Tyr Gly Ser 


Lys 


Pro Gly 


Leu 


lie 
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370 






17 5 
j * v 








380 

Www 










Thr 


Tyr 


He 


Asn Leu Gly 


Ser 


Ser 


Lys 


Val 


Gly Arg 


Trp 


Val 


Tyr 


Val 


385 






390 










J ^ V 








400 


Pro 


Lys 


Phe 


Ala Gly Ala 


Cys 


He 


His 


Glu 


Tyr Thr 


Gly Asn 


Leu 


Gly 








405 








410 








415 




Gly Trp Val Asp Lys Tyr 


Val 


Tyr 


Ser 


Ser 


Gly Trp 


Val 


Tyr 


Leu 


Glu 








420 






425 








430 






Ala 


Pro 


Ala 

435 


Tyr Asp Pro 


Ala 


Asn 
440 


Gly 


Gin 


Tyr Gly 


Tyr 
445 


Ser 


Val 


Trp 


Ser 


Tyr 
450 


Cys 


Gly Val Gly 


Ser 
455 


Glu 


Lys 


Asp 


Glu Leu 
460 











<210> 34 

<211> 825 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 34 



Mpt- 




Val 




Ijf»U 

X-J V» V4 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 








c 










i n 










15 




Ala 


Thr 


Ser 


Ala 


Gly 


His 


Trp 


Tyr 


Lys 


His 


Gin Arg 


Ala 


Tyr 


Gin 


Phe 








20 










25 










30 






Thr 


Gly 


Glu 


Asp 


Asp 


Phe 


Gly 


Lys 


Val 


Ala 


Val 


Val 


Lys 


Leu 


Pro 


Met 






35 










40 










45 








Asp 


Leu 


Thr 


Lys 


Val 


Gly 


He 


He 


Val 


Arg 


Leu 


Asn 


Glu 


Trp 


Gin 


Ala 




50 










55 










60 










Lys 


Asp 


Val 


Ala 


Lys 


Asp 


Arg 


Phe 


He 


Glu 


lie 


Lys 


Asp Gly 


Lys 


Ala 


65 










70 










75 










80 


Glu 


Val 


Trp 


He 


Leu 


Gin 


Gly 


Val 


Glu 


Glu 


He 


Phe 


Tyr 


Glu 


Lys 


Pro 










85 










90 










95 




Asp 


Thr 


Ser 


Pro 


Arg 


He 


Phe 


Phe 


Ala 


Gin 


Ala 


Arg 


Ser 


Asn 


Lys 


Val 






100 










105 










110 






lie 


Glu 


Ala 


Phe 


Leu 


Thr 


Asn 


Pro 


Val 


Asp 


Thr 


Lys 


Lys 


Lys 


Glu 


Leu 






115 










120 










125 








Phe 


Lys 


Val 


Thr 


val 


Asp 


Gly 


Lys 


Glu 


He 


Pro 


Val 


Ser 


Arg 


Val 


Glu 




130 










135 










140 










Lys 


Ala 


Asp 


Pro 


Thr 


Asp 


He 


Asp 


Val 


Thr 


Asn 


Tyr 


Val 


Arg 


He 


Val 


145 










150 










155 










160 


Leu 


Ser 


Glu 


Ser 


Leu 


Lys 


Glu 


Glu 


Asp 


Leu 


Arg 


Lys 


Asp 


Val 


Glu 


Leu 










165 










170 










175 




He 


He 


Glu 


Gly 


Tyr 


Lys 


Pro 


Ala 


Arg 


Val 


He 


Met 


Met 


Glu 


lie 


Leu 








180 










185 










190 






Asp 


Asp 


Tyr 


Tyr 


Tyr 


Asp 


Gly 


Glu 


Leu 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Glu 






195 










200 










205 








Lys 


Thr 


He 


Phe 


Arg 


Val 


Trp 


Ser 


Pro 


Val 


Ser 


Lys 


Trp 


Val 


Lys 


Val 


210 










215 










220 










Leu 


Leu 


Phe 


Lys 


Asn 


Gly 


Glu 


Asp 


Thr 


Glu 


Pro 


Tyr 


Gin 


Val 


Val 


Asn 


225 










230 










235 










240 


Met 


Glu 


Tyr 


Lys 


Gly 


Asn 


Gly 


Val 


Trp 


Glu 


Ala 


Val 


Val 


Glu Gly 


Asp 










245 










250 










255 
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Leu 


Asp 


Gly 


Val 








260 




Th r 


Th t" 
1 1 1 X 








*i *7 Q 
£ / b 




Glu 


Ser 


Ala 

Ml a 


Val 

V U J. 




290 






Asn 


Asp 


Arg 


Pi 1/ 
o i y 


305 








Glu 


lie 


n x o 


Tie 

IXC 


Asn 


Lys 


Gly 


Leu 








340 


Gly Gly 


Val 


L 11 X 






c c 

j b b 




His 


Val 


nib 


T 1 P 
X X c 




370 






Asp 


Lys 


MSp 


n V\ /a 


385 








Phe 


Met 


Va 1 
v a x 


Pro 


Thr 


Arg 


He 


Arg 








420 


Gly 


lie 


P* 1 \r 

Giy 


Val 






A ~i P 

4 3 5 




Gly 


Glu 


Leu 


O a V 

be r 




450 






lie 


Asp 


by s 


1 n x 


465 








lie 


Ala 


Cot* 


U1U 


Thr 


Tyr 


Trp 


val 








500 


Met 


Gly 


Leu 


lie 






bib 




His 


Lys 


lie 


A e »^ 
ASp 




530 






Trp Gly 


a 1 a 
Ala 


f ro 


545 








Val 


Ala 


Ala 


Phe 

rue 


Phe 


Asn 


Pro 


Ser 








580 


Thr 


Lys 


T 1 P 


lj y 9 






D V 0 




Leu 


He 


by s 


C o v 
OCX 




610 






Ala 


Cys 


His 


Asp 


625 








Lys 


Ala 


Asp 


Lys 


Gin 


Lys 


Leu 


Ala 








660 


Leu 


His 


Gly 


Gly 



675 



Phe 

rue 


iyr 


T iPi l 


Tvr 
i y x 


Asp 


Pro 


Tyr 


Ser 








280 


Ua 1 
Vol 


A c n 
no 11 


J-icU 


Al a 
M x a 






C 7 J 




fro 


L»y s 


Tl fl 

lie 


ol u 




J 1 u 






Ala 

M X CI 


Aon 
Mo \J 


Tip 

11C 


Thr* 


^ o c 
j <: j 








iyr 


I .P 1 1 
uC U 


Glv 


T iP it 

JJC u 


Thr 


Gly 


Leu 


Ser 








360 


T rtn 

L*eU 


O y r\ 


rfl e 








1 -J c 

J / b 




ol,, 
blU 


xjy b 


iyr 


T\/ r 

iyr 










UIU 


uiy 


A rn 
mx y 


i y x 


4 05 








ol U 


va i 


jjy s 




He 


Met 


ASp 


Met 








440 


Ala 

ai a 


pne 


ASp 








4 bb 




nl «r 

Gly 


ai a 


Tyr 


Leu 




4 7 0 






A v~ n 

Arg 




rlc u 


Mph 


4 85 








uy s 


pi ., 
ulU 


iyr 


nib 


Asp 


Lys 


Lys 


Thr 








520 


rlO 


Th r 
lilt 


Tip 
x i e 


Tip 
lie 






b J b 




T 1 0 

i ie 


Arg 


rne 


uiy 




bb v 






ASM 


A C T\ 

ASp 


ulU 




c a c; 








Va 1 

v a x 




Glv 
ox y 


Phe 


Arg 


Gly 


Val 


Val 








600 




Ala 
mx a 


T iP 1 1 








615 




Asn 


His 


Thr 


Leu 




630 






Lys 


Lys 


Glu 


Trp 


645 








Gly 


Ala 


lie 


Leu 


Gin 


Asp 


Phe 


Cys 



680 



Gin 


Leu 


Glu 

W 1 u 


Asn 


? 5 

4. O D 








T.\/Q 


Ala 


Val 
v a i 


i yx 


Arg 


Thr 


Asn 


Pro 








300 


Glv 
vj x y 


Tvr 
iyr 


Glu 


rta jj 






J X J 




Glv 


Leu 

■u v 


Glu 


Asn 




J J V 






Th r 

1 1 1 X 


Glu 


Glu 


Asn 

noil 


j 4 D 








nig 


I.pn 
UC u 


Val 

vox 


Glu 

\J 1 u 


Asp 


Phe 


Tyr 


Thr 








380 


Ann 

ho n 


i rp 


Glv 
u i y 


Tvr 

iyr 






J?3 




Car 
o c x 


Th f 

1 i 1 X 


Son 


Prn 

t X w 




4 1 U 






rIC L 


Va 1 
v a i 


xjy o 


Ala 

n i a 


/IOC 

4 2 b 








Va 1 
veil 


Php 
r tie 




His 

U X o 


Thr 


Val 


Pro 


Tyr 








460 


A e r> 
A 5 n 


m ii 

UlU 


oex 


Glv 
vj x y 






1 / D 




Arg 


T ,VR 

u y o 


Phe 

rue 


He 

1 1 w 










Tip 
x x e 


ion 


Glv 
oxy 


Phe 


bUD 








Mpf 
l ie u 


Lpu 

i-J ^— u 


Glu 


Val 


Leu 


Tyr 


Gly 


Glu 








540 


L»y s 


C a v 
OCX 


Jen 
MSp 


Va 1 
val 






c. c 
j j j 




mi y 




Ala 


He 




CTft 
3 f v 






Va 1 
v a x 


ric c 


Glv 
oiy 


Glv 
oxy 










Glv 
o x y 


^pr 


lie 


Asn 

f* Oil 


Pro 

» 1 V 


Glu 


Glu 

w * V* 


Thr 








620 


Trp 


Asp 


Lys 


Asn 






635 




Thr 


Glu 


Glu 


Glu 




650 






Leu 


Thr 


Ser 


Gin 


665 








Arg 


Thr 


Thr 


Asn 



205 



Tvr 

i. y i 


Glv 


Lvs 


He 




270 






Ala 


Asn 


Asn 


Gin 


285 

A V «J 








Glu 


Glv 

vji y 


Trio 


Glu 


Ala 


He 


lie 


Tyr 








320 


Ser 


Glv 

\jiy 


Val 


Lvs 






J J J 




Thr 

A I IX 


uy 3 


Ala 


Pro 




J J V 






Lpu 


Glv 
oxy 


Val 

V d X 


Thr 


J O J 








Glv 

oxy 


Asd 


Glu 


Leu 


Asp 


Pro 


Tyr 


Leu 








400 


Lvs 


Asn 


Pro 


His 






"13 




Iqi I 
L/C U 


n i o 


TiVS 
xj y o 


His 

111 o 




4"ift 






Thr 

i 1 1 X 


Tvr 
i y t 


Glv 
oiy 


He 


4 d Q 
4 ^ D 








Tvr 

iyr 


Php 
cue 


Tvr 
i yi 


Ara 

/\x y 


Cys 


Gly 


Asn 


Val 








480 


Va 1 
v a i 


Asn 


Thr 

X 1 1 1 


Val 

vol 










Ara 


Phe 


Aso 


Gin 

JX> A « 




^ i n 

31v 






Glu 


Ara 


Ala 


Leu 


D £ D 








Pro 


TrD 


Glv 

\ji y 


Glv 


Ala 


Gly 


Thr 


His 








560 




Glv 

oxy 


Ser 


Val 






575 




Tvr 
i y l 


Glv 

oi y 


Lvs 


Glu 




590 

J J w 






Tvr 

x y i 


Asd 


Glv 


LVS 

"7 


605 








He 


Asn 


Tvr 

x y i 


Ala 


Tyr 


Leu 


Ala 


Ala 








640 


Leu 


Lys 


Asn 


Ala 






655 




Gly 


Val 


Pro 


Phe 




670 






Phe 


Asn 


Asp 


Asn 



685 
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Ser 


Tvr 


Asn Ala 


Pro 


He 


Ser 


He 


Asn 


Gly 


Phe 


Asp 


Tyr 


Glu 


Arg 


Lys 




& q n 

V 7 V 








695 










700 










Leu 


Gin 


Phe He 


Asp 


Val 


Phe 


Asn 


Tyr 


His 


Lys 


Gly 


Leu 


He 


Lys 


Leu 


705 








710 










715 










720 


Ara 


IjV e 


Gl u His 


Pro 


Ala 


Phe 


Arg 


Leu 


Lvs 


Asn 


Ala 


Glu 


Glu 


lie 


Lys 








7 2 5 










73 0 










735 




Lvs 


Hie 


Leu Glu 


Phe 


Lpvj 

X-J V*- KM 


Pro Gly 


Gly 


Arg 


Arq 


He 

* * Xk» 


Val 


Ala 


Phe 


Met 






74 0 










745 










750 






Leu 

±rf w w 


Lvs 


Aso His 


Ala 


Glv 


Gly 


Asp 


Pro 


Trp 


Lys 


ASD 


He 


Val 


Val 


He 






755 








760 










765 








Tyr 


Asn 


Gly Asn 


Leu 


Glu 


Lys 


Thr 


Thr 


Tyr 


Lys 


Leu 


Pro 


Glu 


Gly 


Lys 




770 








775 










780 










Trp 


Asn 


Val Val 


Val 


Asn 


Ser 


Gin 


Lys 


Ala 


Gly 


Thr 


Glu 


Val 


lie 


Glu 


785 








790 










795 










800 


Thr 


Val 


Glu Gly 


Thr 


He 


Glu 


Leu 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


Val 


Leu 








805 










810 










815 




Tyr 


Arg 


Glu Ser 


Glu 


Lys 


Asp 


Glu 


Leu 

















820 825 



<210> 35 

<211> 460 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 35 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


1 








5 








Ala 


Thr 


Ser 


Ala 


Lys 


Tyr 


Leu 


Glu 








20 










Gin 


Ala 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 






35 










40 


lie 


Arg 


Gin 


Lys 


He 


Pro 


Glu 


Trp 




50 










55 




Trp 


He 


Pro 


Pro 


Ala 


Ser 


Lys 


Gly 


65 










70 






Tyr 


Asp 


Pro 


Tyr 


Asp 


Tyr 


Phe 


Asp 










85 








Thr 


Val 


Glu 


Thr 


Arg 


Phe 


Gly Ser 








100 










Asn 


Thr 


Ala 


His 


Ala 


Tyr 


Gly 


lie 






115 










120 


Asn 


His 


Arg 


Ala 


Gly 


Gly 


Asp 


Leu 




130 










135 




Tyr 


Thr 


Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


145 










150 






Asn 


Tyr 


Leu 


Asp 


Phe 


His 


Pro 


Asn 










165 








Thr 


Phe 


Gly 


Gly 


Tyr 


Pro 


Asp 


He 








180 










Tyr 


Trp 


Leu 


Trp 


Ala 


Ser 


Gin 


Glu 



Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 




10 










15 




Leu 


Glu 


Glu 


Gly 


Gly 


Val 


lie 


Met 


25 










30 






Ser 


Gly 


Gly 


lie 


Trp 


Trp 


Asp 


Thr 










45 








Tyr 


Asp 


Ala 


Gly 


lie 


Ser 


Ala 


He 








60 










Met 


Ser 


Gly 


Gly 


Tyr 


Ser 


Met 


Gly 






75 










80 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 




90 










95 




Lys 


Gin 


Glu 


Leu 


lie 


Asn 


Met 


He 


105 










110 






Lys 


Val 


lie 


Ala 


Asp 


lie 


Val 


lie 










125 








Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 








140 










Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 






155 










160 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 




170 










175 




Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


185 










190 






Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 
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195 










200 


He 


Gly 


He 


Asp 


Ala 


Trp 

* tr 


Arg 


Phe 




210 










215 




Trp 


Val 


Val 


Lys 


Asp 


Trp 


Leu 


Asn 


225 










230 






Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 










245 








Ser Gly 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 








260 










Ala 


Phe 


Asp 


Asn 


Lys 


Asn 


lie 


Pro 






275 










2 80 


Gly 


Gly 

4 


Thr 


Val 


Va 1 


Ser 


Arg 


Asp 




290 










295 




Ala 


Asn 


His 


Asp 


Thr 


Asp 


lie 


He 


305 










310 






Phe 


He 


Leu 


Thr 


Tvr 


Glu 


Gly 


Gin 










325 








Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 








340 










Asp 


Asn 


Leu 


Ala 


Gly 
* 


Gly 
* 


Ser 


Thr 






355 










360 


Glu 


Met 


He 


Phe 


Val 


Ara 


Asn 


Gly 




370 










375 




Thr 


Tyr 


He 


Asn 


Leu 


Gly 


Ser 


Ser 


385 










-ion 






Pro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


He 










405 








Gly 


Trp 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 








420 










Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 






435 










440 


Ser 


Tyr 


Cys 


Gly Val 


Gly 


Ser 


Glu 




450 










455 











205 






Asp Tvr 


Val 


Lys 


Glv Tvr 


Gly Ala 






220 








Trp Trp 


Gly Gly 


Trp Ala 


Val 


Gly 




235 








240 


Ala Leu 


Leu 


Asn 


Trp Ala 


Tyr 


Ser 


250 








255 




Pro Leu 


Tyr 


Tyr 


Lys Met 


Asp 


Ala 


265 






270 






Ala Leu 


Val 


Glu 


Ala Leu 


Lys 


Asn 








285 

\j 






Pro Phe 


Lys 


Ala 


Val Thr 


Phe 


Val 






300 








Trp Asn 


Lys 


Tyr 


Pro Ala 


Tyr 


Ala 




315 








320 


Pro Thr 


lie 


Phe 


Tvr Arq 


Asp 


Tyr 


330 








335 




Leu Lys 


Asn 


Leu 


He Trp 


lie 


His 


345 






350 






Ser He 


Val 


Tyr 


Tvr Aso 


Ser 


Asp 








365 

J V -J 






Tyr Gly Ser 


Lys 


Pro Gly 


Leu 


He 






380 








Lys Val 


Gly Arg 


Trp Val 


Tyr 


Val 




395 








400 


His Glu 


Tyr 


Thr 


Gly Asn 


Leu 


Gly 


410 








415 




Ser Ser 


Gly 


Trp 


Val Tyr 


Leu 


Glu 


425 






430 






Gly Gin 


Tyr 


Gly 


Tyr Ser 


Val 


Trp 








445 






Lys Asp 


Glu 


Leu 









460 



<210> 36 

<2U> 718 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 36 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu Ala 


Ala 


Ser 


1 








5 










10 








15 




Ala 


Thr 


Ser 


Met 


Glu 


Thr 


He 


Lys 


lie 


Tyr 


Glu 


Asn 


Lys Gly 


Val 


Tyr 








20 










25 








30 






Lys 


Val 


Val 


lie 


Gly 


Glu 


Pro 


Phe 


Pro 


Pro 


He 


Glu 


Phe Pro 


Leu 


Glu 






35 










40 










45 






Gin 


Lys 


He 


Ser 


Ser 


Asn 


Lys 


Ser 


Leu 


Ser 


Glu 


Leu 


Gly Leu 


Thr 


lie 




50 










55 










60 








Val 


Gin 


Gin 


Gly 


Asn 


Lys 


val 


He 


Val 


Glu 


Lys 


Ser 


Leu Asp 


Leu 


Lys 


65 










70 










75 








80 
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Glu His lie He Gly Leu Gly Glu Lys Ala Phe Glu Leu Asp Arg Lys 

85 90 95 

Arg Lys Arg Tyr Val Met Tyr Asn Val Asp Ala Gly Ala Tyr Lys Lys 

100 105 no 

Tyr Gin Asp Pro Leu Tyr Val Ser lie Pro Leu Phe He Ser Val Lys 

115 120 125 

Asp Gly Val Ala Thr Gly Tyr Phe Phe Asn Ser Ala Ser Lys Val He 

130 135 140 

Phe Asp Val Gly Leu Glu Glu Tyr Asp Lys Val He Val Thr He Pro 
145 150 155 160 

Glu Asp Ser Val Glu Phe Tyr Val He Glu Gly Pro Arg He Glu Asp 

165 170 175 

Val Leu Glu Lys Tyr Thr Glu Leu Thr Gly Lys Pro Phe Leu Pro Pro 

180 185 190 

Met Trp Ala Phe Gly Tyr Met He Ser Arg Tyr Ser Tyr Tyr Pro Gin 

195 200 205 

Asp Lys Val Val Glu Leu Val Asp lie Met Gin Lys Glu Gly Phe Arg 

210 215 220 

Val Ala Gly Val Phe Leu Asp He His Tyr Met Asp Ser Tyr Lys Leu 
225 230 235 240 

Phe Thr Trp His Pro Tyr Arg Phe Pro Glu Pro Lys Lys Leu He Asp 

245 250 255 

Glu Leu His Lys Arg Asn Val Lys Leu He Thr lie Val Asp His Gly 

260 265 270 

He Arg Val Asp Gin Asn Tyr Ser Pro Phe Leu Ser Gly Met Gly Lys 

275 280 285 

Phe Cys Glu lie Glu Ser Gly Glu Leu Phe Val Gly Lys Met Trp Pro 

290 295 300 

Gly Thr Thr Val Tyr Pro Asp Phe Phe Arg Glu Asp Thr Arg Glu Trp 
305 310 315 320 

Trp Ala Gly Leu lie Ser Glu Trp Leu Ser Gin Gly Val Asp Gly lie 

325 330 335 

Trp Leu Asp Met Asn Glu Pro Thr Asp Phe Ser Arg Ala lie Glu lie 

340 345 350 

Arg Asp Val Leu Ser Ser Leu Pro Val Gin Phe Arg Asp Asp Arg Leu 

355 360 365 

Val Thr Thr Phe Pro Asp Asn Val Val His Tyr Leu Arg Gly Lys Arg 

370 375 380 

Val Lys His Glu Lys Val Arg Asn Ala Tyr Pro Leu Tyr Glu Ala Met 
385 390 395 400 

Ala Thr Phe Lys Gly Phe Arg Thr Ser His Arg Asn Glu lie Phe lie 

405 410 415 

Leu Ser Arg Ala Gly Tyr Ala Gly lie Gin Arg Tyr Ala Phe lie Trp 

420 425 430 

Thr Gly Asp Asn Thr Pro Ser Trp Asp Asp Leu Lys Leu Gin Leu Gin 

435 440 445 

Leu Val Leu Gly Leu Ser lie Ser Gly Val Pro Phe Val Gly Cys Asp 

450 455 460 

lie Gly Gly Phe Gin Gly Arg Asn Phe Ala Glu lie Asp Asn Ser Met 

465 470 475 480 

Asp Leu Leu Val Lys Tyr Tyr Ala Leu Ala Leu Phe Phe Pro Phe Tyr 

485 490 495 

Arg Ser His Lys Ala Thr Asp Gly lie Asp Thr Glu Pro Val Phe Leu 

500 505 510 
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Pro Asp Tyr Tyr Lys Glu Lys Val Lys Glu lie Val Glu Leu Arg Tyr 

515 520 525 

Lys Phe Leu Pro Tyr He Tyr Ser Leu Ala Leu Glu Ala Ser Glu Lys 

530 535 540 

Gly His Pro Val He Arg Pro Leu Phe Tyr Glu Phe Gin Asp Asp Asp 
545 550 555 560 

Asp Met Tyr Arg He Glu Asp Glu Tyr Met Val Gly Lys Tyr Leu Leu 

565 570 575 

Tyr Ala Pro He Val Ser Lys Glu Glu Ser Arg Leu Val Thr Leu Pro 

580 585 590 

Arg Gly Lys Trp Tyr Asn Tyr Trp Asn Gly Glu He He Asn Gly Lys 

595 600 605 

Ser Val Val Lys Ser Thr His Glu Leu Pro lie Tyr Leu Arg Glu Gly 

610 615 620 

Ser He He Pro Leu Glu Gly Asp Glu Leu He Val Tyr Gly Glu Thr 
625 630 635 640 

Ser Phe Lys Arg Tyr Asp Asn Ala Glu He Thr Ser Ser Ser Asn Glu 

645 650 655 

He Lys Phe Ser Arg Glu He Tyr Val Ser Lys Leu Thr He Thr Ser 

660 665 670 

Glu Lys Pro Val Ser Lys He He Val Asp Asp Ser Lys Glu He Gin 

675 680 685 

Val Glu Lys Thr Met Gin Asn Thr Tyr Val Ala Lys He Asn Gin Lys 

690 695 700 

He Arg Gly Lys He Asn Leu Glu Ser Glu Lys Asp Glu Leu 
705 710 715 



<210> 37 
<211> 1434 
<212> DMA 

<213> Thermotoga maritima 
<400> 37 

atgaaagaaa ccgctgctgc taaattcgaa cgccagcaca tggacagccc agatctgggt 60 

accctggtgc cacgcggttc catggccgag ttcttcccgg agatcccgaa gatccagttc 120 

gagggcaagg agtccaccaa cccgctcgcc ttccgcttct acgacccgaa cgaggtgatc 180 

gacggcaagc cgctcaagga ccacctcaag ttctccgtgg ccttctggca caccttcgtg 240 

aacgagggcc gcgacccgtt cggcgacccg accgccgagc gcccgtggaa ccgcttctcc 300 

gacccgatgg acaaggcctt cgcccgcgtg gacgccctct tcgagttctg cgagaagctc 360 

aacatcgagt acttctgctt ccacgaccgc gacatcgccc cggagggcaa gaccctccgc 420 

gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 480 

aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 

gccgccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 

ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 

tacgagaccc tcctcaacac cgacctcggc ctggagctgg agaacctcgc ccgcttcctc 720 

cgcatggccg tggagtacgc caagaagatc ggcttcaccg gccagttcct catcgagccg 780 

aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 840 

ctcaagaacc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 

ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 

cccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 

aacatctacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 

ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 

atcggccaca tcgccggcat ggacaccttc gccctcggct tcaagatcgc ctacaagctc 1200 
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gccaaggacg gcgtgttcga 
atcggcaagg agatcgtgga 
gacaaggagg acatcgagct 
tcctacatcg tgaagaccat 



caagttcatc gaggagaagt 
gggcaagacc gacttcgaga 
gccgtccggc aagcaggagt 
cgccgagctg cgctccgaga 



accgctcctt caaggagggc 1260 
agctggagga gtacatcatc 1320 
acctggagtc cctcctcaac 1380 
aggacgagct gtga 14 34 



<210> 38 

<211> 477 

<212> PRT 

<213> Thermotoga maritima 



<400> 38 
Met Lys Glu Thr 
1 

Pro Asp Leu Gly 

20 

Pro Glu lie Pro 
35 

Leu Ala Phe Arg 

50 

Leu Lys Asp His 
65 

Asn Glu Gly Arg 

Asn Arg Phe Ser 

100 

Leu Phe Glu Phe 
115 

Asp Arg Asp He 
130 

He Leu Asp Lys 
145 

Asn Val Lys Leu 

Tyr Met His Gly 

180 

Ala Ala Ala Gin 
195 

Gly Glu Gly Tyr 
210 

Leu Asn Thr Asp 
225 

Arg Met Ala Val 

Leu He Glu Pro 

260 

Asp Val Ala Thr 
275 

Tyr Phe Lys Phe 
290 

Thr Phe Gin His 
305 

Ser He Asp Ala 
Gin Phe Pro Thr 



Ala Ala Ala Lys 
5 

Thr Leu Val Pro 

Lys He Gin Phe 

40 

Phe Tyr Asp Pro 
55 

Leu Lys Phe Ser 
70 

Asp Pro Phe Gly 
85 

Asp Pro Met Asp 

Cys Glu Lys Leu 

120 

Ala Pro Glu Gly 
135 

Val Val Glu Arg 
150 

Leu Trp Gly Thr 
165 

Ala Ala Thr Thr 

Val Lys Lys Ala 

200 

Val Phe Trp Gly 
215 

Leu Gly Leu Glu 
230 

Glu Tyr Ala Lys 
245 

Lys Pro Lys Glu 

Ala Tyr Ala Phe 

280 

Asn He Glu Ala 
295 

Glu Leu Arg Met 
310 

Asn Gin Gly Asp 
325 

Asn He Tyr Asp 



Phe Glu Arg Gin 
10 

Arg Gly Ser Met 
25 

Glu Gly Lys Glu 

Asn Glu Val He 

60 

Val Ala Phe Trp 
75 

Asp Pro Thr Ala 
90 

Lys Ala Phe Ala 
105 

Asn He Glu Tyr 

Lys Thr Leu Arg 

140 

He Lys Glu Arg 
155 

Ala Asn Leu Phe 
170 

Cys Ser Ala Asp 
185 

Leu Glu He Thr 

Gly Arg Glu Gly 

220 

Leu Glu Asn Leu 
235 

Lys He Gly Phe 
250 

Pro Thr Lys His 
265 

Leu Lys Asn His 

Asn His Ala Thr 

300 

Ala Arg He Leu 
315 

Leu Leu Leu Gly 
330 

Thr Thr Leu Ala 
210 





Mpt- 


no £j 








15 




Mid 


VJ J. u 


Php 

r lie 


Php 
rue 












Th TT 
1111. 


Icn 
nS 11 


P T"0 


4 S 








h. e n 


\j ± y 




It L \J 


His 


Thr 


Phe 


Val 








80 


pin 




r I U 


Trn 






95 




arg 


Va 1 

vox 


no £j 


Ala 




110 

-1. .1. W 






Php 




Phe 

rue 


His 


125 








Glu 

OiU 


Thy 

1 1 I im 


r\. «— > i k 


Lvs 


Met 


Lys 


Asp 


Ser 








160 




His 


Pro 


Ara 






175 




Val 


Phe 


Ala 


Tvr 




190 






Lys 


Glu 


Leu 


Gly 


205 








Tyr 


Glu 


Thr 


Leu 


Ala 


Arg 


Phe 


Leu 








240 


Thr 


Gly 


Gin 


Phe 






255 




Gin 


Tyr 


Asp 


Phe 




270 






Gly 


Leu 


Asp 


Glu 


285 








Leu 


Ala 


Gly 


His 


Gly 


Lys 


Leu 


Gly 








320 


Trp 


Asp 


Thr 


Asp 






335 




Met 


Tyr 


Glu 


Val 
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340 



lie 


Lys 


Ala 


Gly Gly 


Phe 


Thr 


Lvs 






355 










360 


Val 


Arg 


Ara 


Ala 


Ser 


Tvr 


Lvs 


Val 




370 










375 




Ala Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 


385 










390 






Ala 


Lys 




Gly 


Val 


r II C 




I.wq 










405 








Phe 


Lys 


Glu 


Gly 


He 


Gly 


Lys 


Glu 








420 










Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 






435 










440 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 




450 










455 




Lys 


Thr 


He 


Ala 


Glu 


Leu 


Arg 


Ser 


465 










470 







345 










350 






Gly Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 










365 








Glu 


Asp 


Leu 


Phe 


He 


Gly 


His 


lie 








380 










Gly 


Phe 


Lvs 


He 


Ala 


Tyr 


Lys 


Leu 






395 










400 


Phe 


He 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 




410 










415 




He 


Val 


Glu 


Gly 


Lys 


Thr 


Asp 


Phe 


425 










430 






Asp 


Lys 


Glu 


Asp 


He 


Glu 


Leu 


Pro 










445 








Ser 


Leu 


Leu 


Asn 


Ser 


Tyr 


He 


Val 








460 










Glu 


Lys 


Asp 


Glu 


Leu 









475 



<210> 39 
<211> 1434 
<212> DNA 

<213> Thermotoga neapolitana 
<400> 39 

atgaaagaaa ccgctgctgc taaattcgaa cgccagcaca tggacagccc agatctgggt 60 

accctggtgc cacgcggttc catggccgag ttcttcccgg agatcccgaa ggtgcagttc 120 

gagggcaagg agtccaccaa cccgctcgcc ttcaagttct acgacccgga ggagatcatc 180 

gacggcaagc cgctcaagga ccacctcaag ttctccgtgg ccttctggca caccttcgtg 240 

aacgagggcc gcgacccgtt cggcgacccg accgccgacc gcccgtggaa ccgctacacc 300 

gacccgatgg acaaggcctt cgcccgcgtg gacgccctct tcgagttctg cgagaagctc 360 

aacatcgagt acttctgctt ccacgaccgc gacatcgccc cggagggcaa gaccctccgc 420 

gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 480 

aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 

gccgccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 

ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 

tacgagaccc tcctcaacac cgacctcggc ttcgagctgg agaacctcgc ccgcttcctc 720 

cgcatggccg tggactacgc caagcgcatc ggcttcaccg gccagttcct catcgagccg 780 

aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 840 

ctcaagtccc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 

ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 

tccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 

aacgtgtacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 

ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 

atcggccaca tcgccggcat ggacaccttc gccctcggct tcaaggtggc ctacaagctc 1200 

gtgaaggacg gcgtgctcga caagttcatc gaggagaagt accgctcctt ccgcgagggc 1260 

atcggccgcg acatcgtgga gggcaaggtg gacttcgaga agctggagga gtacatcatc 1320 

gacaaggaga ccatcgagct gccgtccggc aagcaggagt acctggagtc cctcatcaac 1380 

tcctacatcg tgaagaccat cctggagctg cgctccgaga aggacgagct gtga 1434 

<210> 40 
<211> 477 
<212> PRT 

<213> Thermotoga neapolitana 
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<400> 40 



Met 


Lys 


Glu 


Thr 


Ala 


Ala 


Ala 


Lys 


Phe 


Glu 


Arg 


Gin 


His 


Met 


Asp 


Ser 


1 








5 










10 










15 




Pro 


Asp 


Leu 


Gly 


Thr 


Leu 


Val 


Pro 


Arg 


Gly 


Ser 


Met 


Ala 


Glu 


Phe 


Phe 








20 










25 










30 






Pro 


Glu 


He 


Pro 


Lys 


Val 


Gin 


Phe 


Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


Asn 


Pro 






35 










40 










45 








Leu 


Ala 


Phe 


Lys 


Phe 


Tyr 


Asp 


Pro 


Glu 


Glu 


He 


He 


Asp Gly 


Lys 


Pro 




50 










55 










60 










Leu 


Lys 


Asp 


His 


Leu 


Lys 


Phe 


Ser 


Val 


Ala 


Phe 


Trp 


His 


Thi* 

L It i. 


Phe 


Val 


65 








70 










75 










80 


Asn 


Glu 


Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Pro 


Thr 


Ala 


Asp 


Arg 


Pro 


Trp 










85 










90 










95 




Asn 


Arg 


Tyr 


Thr 


Asp 


Pro 


Met 


Asp 


Lys 


Ala 


Phe 


Ala 


Arg 


Val 


Asp 


Ala 








100 










105 










HO. 






Leu 


Phe 


Glu 


Phe 


Cys 


Glu 


Lys 


Leu 


Asn 


He 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 






115 










120 










125 








Asp 


Arg 


Asp 


He 


Ala 


Pro 


Glu 


Gly 


Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 




130 










135 










140 










lie 


Leu 


Asp 


Lys 


Val 


val 


Glu 


Arg 


He 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


145 








150 










155 










160 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


Ala 


Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 








165 
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Met 
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Gly 
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Ala 
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Cys 
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Ala 
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Val 
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Ala 
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190 






Ala 


Ala 
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Lys 
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Glu 


Leu 


Gly 
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Trp 


Gly 
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Glu 


Gly Tyr Glu Thr 
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Gly Trp Asp Thr 


Asp 








325 
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Val 
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Ala 


Gly 
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Lys 


Gly 
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385 
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Lys 


Tyr 


Arg 


Ser 








405 










410 










415 
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Phe Arg Glu Gly He Gly Arg Asp He Val Glu Gly Lys Val Asp Phe 

420 425 430 

Glu Lys Leu Glu Glu Tyr He He Asp Lys Glu Thr lie Glu Leu Pro 

435 440 445 

Ser Gly Lys Gin Glu Tyr Leu Glu Ser Leu He Asn Ser Tyr He Val 

450 455 460 

Lys Thr He Leu Glu Leu Arg Ser Glu Lys Asp Glu Leu 
465 470 475 



<210> 41 
<211> 1435 
<212> DNA 

<213> Thermotoga maritima 



<400> 41 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 
atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 
gagatcccga agatccagtt cgagggcaag gagtccacca acccgctcgc cttccgcttc 180 
cacgacccga acgaggtgat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 
gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgag 300 
cgcccgtgga accgcttctc cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 
ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 
cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 
aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 
acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 
ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 
tctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ctggagctgg 720 
agaacctcgc ccgcttcctc cgcatggccg tggagtacgc caagaagatc ggcttcaccg 780 
gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gcttcgacgt 840 
ggccaccgcc tacgccttcc tcaagaacca cggcctcgac gagtacttca agttcaacat 900 
cgaggccaac cacgccaccc tcgccggcca caccttccag cacgagctgc gcatggcccg 960 
catcctcggc aagctcggct ccatcgacgc caaccagggc gacctcctcc tcggctggga 1020 
caccgaccag ttcccgacca acatctacga caccaccctc gccatgtacg aggtgatcaa 1080 
ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc aaggtgcgcc gcgcctccta 1140 
caaggtggag gacctcttca tcggccacat cgccggcatg gacaccttcg ccctcggctt 1200 
caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac aagttcatcg aggagaagta 1260 
ccgctccttc aaggagggca tcggcaagga gatcgtggag ggcaagaccg acttcgagaa 13 20 
gctggaggag tacatcatcg acaaggagga catcgagctg ccgtccggca agcaggagta 13 80 
cctggagtcc ctcctcaact cctacatcgt gaagaccatc gccgagctgc gctga 14 35 

<210> 42 
<211> 478 
<212> PRT 

<213> Thermotoga maritima 
<400> 42 

Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

15 10 15 

Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

20 25 30 

lie Pro Met Ala Glu Phe Phe Pro Glu He Pro Lys He Gin Phe Glu 

35 40 45 

Gly Lys Glu Ser Thr Asn Pro Leu Ala Phe Arg Phe Tyr Asp Pro Asn 
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<210> 43 

<211> 1436 

<212> DNA 

<213> Thermotoga neapolitana 



<400> 43 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 
atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gtccttcccg 120 
gagatcccga aggtgcagtt cgagggcaag gagtccacca acccgctcgc cttcaagttc 180 
tacgacccgg aggagatcat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 
gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgac 300 
cgcccgtgga accgctacac cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 
ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 
cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 
aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 
acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 
ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggccacgtgt 660 
tctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ttcgagctgg 720 
agaacctcgc ccgcttcctc cgcatggccg tggactacgc caagcgcatc ggcttcaccg 780 
gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gacttcgacg 840 
tggccaccgc ctacgccttc ctcaagtccc acggcctcga cgagtacttc aagttcaaca 900 
tcgaggccaa ccacgccacc ctcgccggcc acaccttcca gcacgagctg cgcatggccc 960 
gcatcctcgg caagctcggc tccatcgacg ccaaccaggg cgacctcctc ctcggctggg 1020 
acaccgacca gttcccgacc aacgtgtacg acaccaccct cgccatgtac gaggtgatca 1080 
aggccggcgg cttcaccaag ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct 1140 
acaaggtgga ggacctcttc atcggccaca tcgccggcat ggacaccttc gccctcggct 1200 
tcaaggtggc ctacaagctc gtgaaggacg gcgtgctcga caagttcatc gaggagaagt 1260 
accgctcctt ccgcgagggc atcggccgcg acatcgtgga gggcaaggtg gacttcgaga 1320 
agctggagga gtacatcatc gacaaggaga ccatcgagct gccgtccggc aagcaggagt 1380 
acctggagtc cctcatcaac tcctacatcg tgaagaccat cctggagctg cgctga 1436 



<210> 44 

<211> 478 

<212> PRT 

<213> Thermotoga neapolitana 



<400> 44 

Met Gly Ser Ser 
l 

Arg Gly Ser His 

20 

He Pro Met Ala 
35 

Gly Lys Glu Ser 
50 

Glu He He Asp 
65 

Ala Phe Trp His 

Pro Thr Ala Asp 

100 

Ala Phe Ala Arg 
115 



His His His His 
5 

Met Ala Ser Met 
Glu Phe Phe Pro 

40 

Thr Asn Pro Leu 
55 

Gly Lys Pro Leu 
70 

Thr Phe Val Asn 
85 

Arg Pro Trp Asn 

Val Asp Ala Leu 
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Glu 
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Gly Leu Val Pro 
15 

Gin Met Gly Arg 
30 

Val Gin Phe Glu 
45 

Tyr Asp Pro Glu 

Lys Phe Ser Val 

80 

Pro Phe Gly Asp 
95 

Pro Met Asp Lys 
110 

Glu Lys Leu Asn 
125 
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<210> 45 
<211> 1095 
<212> PRT 

<213> Aspergillus shirousami 
<400> 45 

Ala Thr Pro Ala Asp Trp Arg Ser Gin Ser lie Tyr Phe Leu Leu Thr 
15 10 15 
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Ser Thr Thr Ala 

25 

Thr Trp Gin Gly 

Phe Thr Ala He 

60 

Thr Ala Tyr Gly 
75 

Ser Leu Asn Glu 
90 

Ser Ala Leu His 
105 

Asn His Met Gly 

Phe Lys Pro Phe 

140 

Gin Asn Tyr Glu 
155 

Asn Thr Val Ser 
170 

Asn Glu Trp Tyr 
185 

Asp Gly Leu Arg 

Pro Gly Tyr Asn 
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Asp Val Asp Pro 
235 

Val Leu Asn Tyr 
250 

Thr Ser Gly Ser 
265 

Ser Asp Cys Pro 

Asp Asn Pro Arg 

300 

Asn Val Ala Ala 
315 

Ala Gly Gin Glu 
330 

Glu Ala Thr Trp 
345 

Leu He Ala Ser 

Thr Gly Phe Val 

380 

Thr He Ala Met 
395 

Leu Ser Asn Lys 
410 

Gly Ala Gly Tyr 
425 

Thr Thr Val Thr 
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1025 1030 1035 1040 

Pro Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser Phe Glu Tyr 

1045 1050 1055 

Lys Phe lie Arg Val Glu Ser Asp Asp Ser Val Glu Trp Glu Ser Asp 

1060 1065 1070 

Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala Cys Gly Glu Ser Thr Ala 

1075 1080 1085 

Thr Val Thr Asp Thr Trp Arg 
1090 1095 



<210> 46 

<211> 3285 

c212> DMA 

<213> Aspergillus shirousami 



<400> 46 

gccaccccgg ccgactggcg 
cgcaccgacg gctccaccac 
acctggcagg gcatcatcga 
tggatcaccc cggtgaccgc 
ggctactggc agcaggacat 
aaggccctct cctccgccct 
aaccacatgg gctacgacgg 
tcctcccagg actacttcca 
gtggaggact gctggctcgg 
gacgtggtga agaacgagtg 
gacggcctcc gcatcgacac 
aaggccgccg gcgtgtactg 
ccgtaccaga acgcgatgga 
gccttcaagt ccacctccgg 
tccgactgcc cggactccac 
ttcgcctcct acaccaacga 
aacgacggca tcccgatcat 
ccggccaacc gcgaggccac 



ctcccagtcc atctacttcc 
cgccacctgc aacaccgccg 
caagctcgac tacatccagg 
ccagctcccg cagaccaccg 
ctactccctc aacgagaact 
ccacgagcgc ggcatgtacc 
cgccggctcc tccgtggact 
cccgttctgc ttcatccaga 
cgacaacacc gtgtccctcc 
gtacgactgg gtgggctccc 
cgtgaagcac gtgcagaagg 
catcggcgag gtgctcgacg 
cggcgtgctc aactacccga 
ctcgatggac gacctctaca 
cctcctcggc accttcgtgg 
catcgccctc gccaagaacg 
ctacgccggc caggagcagc 
ctggctctcc ggctacccga 

219 



tcctcaccga ccgcttcgcc 60 
accagaagta ctgcggcggc 120 
gcatgggctt caccgccatc 180 
cctacggcga cgcctaccac 240 
acggcaccgc cgacgacctc 300 
tcatggtgga cgtggtggcc 360 
actccgtgtt caagccgttc 420 
actacgagga ccagacccag 480 
cggacctcga caccaccaag 540 
tcgtgtccaa ctactccatc 600 
acttctggcc gggctacaac 660 
tggacccggc ctacacctgc 720 
tctactaccc gctcctcaac 780 
acatgatcaa caccgtgaag 840 
agaaccacga caacccgcgc 900 
tggccgcctt catcatcctc 960 
actacgccgg cggcaacgac 1020 
ccgactccga gctgtacaag 1080 
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ctcatcgcct ccgccaacgc catccgcaac tacgccatct ccaaggacac cggcttcgtg 1140 
acctacaaga actggccgat ctacaaggac gacaccacca tcgccatgcg caagggcacc 1200 
gacggctccc agatcgtgac catcctctcc aacaagggcg cctccggcga ctcctacacc 1260 
ctctccccct ccggcgccgg ctacaccgcc ggccagcagc tcaccgaggt gatcggccgc 1320 
accaccgtga ccgtgggctc cgacggcaac gtgccggtgc cgatggccgg cggcctcccg 1380 
cgcgtgccct acccgaccga gaagctcgcc ggctccaaga tatgctcctc ctccaagccg 1440 
gccaccctcg actcctggct ctccaacgag gccaccgtgg cccgcaccgc catcctcaac 1500 
aacatcggcg ccgacggcgc ctgggtgtcc ggcgccgact ccggcatcgt ggtggcctcc 1560 
ccgtccaccg acaacccgga ctacttctac acctggaccc gcgactccgg catcgtgctc 1620 
aagaccctcg tggacctctt ccgcaacggc gacaccgacc tcctctccac catcgagcac 1680 
tacatctcct cccaggccat catccagggc gtgtccaacc cgtccggcga cctctcctcc 1740 
ggcggcctcg gcgagccgaa gttcaacgtg gacgagaccg cctacgccgg ctcctggggc 1800 
cgcccgcagc gcgacggccc ggccctccgc gccaccgcca tgatcggctt cggccagtgg 1860 
ctcctcgaca acggctacac ctccgccgcc accgagatcg tgtggccgct cgtgcgcaac 1920 
gacctctcct acgtggccca gtactggaac cagaccggct acgacctctg ggaggaggtg 1980 
aacggctcct ccttcttcac catcgccgtg cagcaccgcg ccctcgtgga gggctccgcc 2040 
ttcgccaccg ccgtgggctc ctcctgctcc tggtgcgact cccaggcccc gcagatcctc 2100 
tgctacctcc agtccttctg gaccggctcc tacatcctcg ccaacttcga ctcctcccgc 2160 
tccggcaagg acaccaacac cctcctcggc tccatccaca ccttcgaccc ggaggccggc 2220 
tgcgacgact ccaccttcca gccgtgctcc ccgcgcgccc tcgccaacca caaggaggtg 2280 
gtggactcct tccgctccat ctacaccctc aacgacggcc tctccgactc cgaggccgtg 2340 
gccgtgggcc gctacccgga ggactcctac tacaacggca acccgtggct cctctgcacc 2400 
ctcgccgccg ccgagcagct ctacgacgcc ctctaccagt gggacaagca gggctccctg 2460 
gagatcaccg acgtgtccct cgacttcttc aaggccctct actccggcgc cgccaccggc 2520 
acctactcct cctcctcctc cacctactcc tccatcgtgt ccgccgtgaa gaccttcgcc 2580 
gacggcttcg tgtccatcgt ggagacccac gccgcctcca acggctccct ctccgagcag 2640 
ttcgacaagt ccgacggcga cgagctgtcc gcccgcgacc tcacctggtc ctacgccgcc 2700 
ctcctcaccg ccaacaaccg ccgcaactcc gtggtgccgc cgtcctgggg cgagacctcc 2760 
gcctcctccg tgccgggcac ctgcgccgcc acctccgcct ccggcaccta ctcctccgtg 2820 
accgtgacct cctggccgtc catcgtggcc accggcggca ccaccaccac cgccaccacc 2880 
accggctccg gcggcgtgac ctccacctcc aagaccacca ccaccgcctc caagacctcc 2940 
accaccacct cctccacctc ctgcaccacc ccgaccgccg tggccgtgac cttcgacctc 3000 
accgccacca ccacctacgg cgagaacatc tacctcgtgg gctccatctc ccagctcggc 3060 
gactgggaga cctccgacgg catcgccctc tccgccgaca agtacacctc ctccaacccg 3120 
ccgtggtacg tgaccgtgac cctcccggcc ggcgagtcct tcgagtacaa gttcatccgc 3180 
gtggagtccg acgactccgt ggagtgggag tccgacccga accgcgagta caccgtgccg 3240 
caggcctgcg gcgagtccac cgccaccgtg accgacacct ggcgc 32 85 

<210> 47 
<211> 679 
<212> PRT 

<213> Thermoanaerobacterium thermosaccharolyticum 

<400> 47 

Val Leu Ser Gly Cys Ser Asn Asn Val Ser Ser lie Lys He Asp Arg 

15 10 15 

Phe Asn Asn He Ser Ala Val Asn Gly Pro Gly Glu Glu Asp Thr Trp 

20 25 30 

Ala Ser Ala Gin Lys Gin Gly Val Gly Thr Ala Asn Asn Tyr Val Ser 

35 40 45 

Arg Val Trp Phe Thr Leu Ala Asn Gly Ala He Ser Glu Val Tyr Tyr 

50 55 60 

Pro Thr He Asp Thr Ala Asp Val Lys Glu He Lys Phe He Val Thr 
65 70 75 80 
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He 


Asp 


Asn 


Leu 


Thr 










485 










Gly 


Asn 


Gly 


Gin 


Tyr 


Tyr 


lie 


Arg 


He 








500 










505 
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Thr 


Lys 


Asp 


Ala 


He 


Ser 


Lys 


90 










95 




Gly 


Tyr 


Lys 


Leu 


Val 


Asn 


Thr 










110 






Lys 


Glu 


lie 


Phe 


Thr 


Asp 


Val 








125 








Lys 


Phe 


Glu 


Ala 


Leu 


Glu 


Gly 






140 










Ala 


Tyr 


Asp 


Pro 


His 


lie 


Lys 




155 










160 


Val 


He 


Lys 


Ala 


Asn 


Asn 


Asn 


170 










175 




Val 


Tyr 


Thr 


Ala 


Leu 


Ser 


Ser 










190 






Gly 


Tyr 


Tyr 

4 


Lys 


Val 


Asn 


Asp 

*> 








205 








Gin 


Met 


Thr 


Lys 


His 


Tyr 


Asp 






220 










Ala 


Glu 


He 


Asp 


Leu 


Thr 


Lys 




235 










240 


Phe 


Gly 


Gly 


Ser 


Asp 


Ser 


Glu 


250 










255 




Gly 


Glu 


Asp 


Tyr 

4 


Asn 


Asn 


Leu 










270 






Lys 


Tvr 


Cys 


Asn 


Thr 


Leu 


Asn 








285 








Tyr 


Tyr 


Asn 


Ser 


Met 


Met 


He 






300 










Lys 


Gly 


Ala 


Tyr 
* 


He 


Ala 


Ser 




315 










320 


Arq 


Asp 


Asp 


Asn 


Thr 


Gly 


Gly 


330 










335 




Tyr 


His 


Val 


Ala 


Asn 


Ala 


Phe 








350 






Asn 


Arg 


Ser 


Leu 


Asp 

*> 


Tyr 


Leu 








365 








He 


Pro 


Gin 


Asn 


Thr 


Trp 


lie 






380 










Gin 


Leu 


Asp 


Glu 


Gin 


Ala 


Asp 




395 










400 


Arg 


Tyr 


Asp 


Leu 


Tyr 


Asp 


Ser 


410 










415 




He 


Lys 

* 


He 


Gly 


Pro 


Lys 


Thr 










430 






Gly 


Tyr 


Ser 


Pro 


Ala 


Thr 


Met 








445 








Ala 


Ala 


Tyr 


lie 


Ala 


Glu 


Gin 






460 










Tyr 


Gin 


Glu 


Lys 


Ala 


Asp 


Asn 




475 










480 


Tyr 


Thr 


Glu 


Asn 


Gly 


Pro 


Leu 


490 










495 




Ala 


Gly 


Leu 


Ser 


Asp 


Pro 


Asn 



510 
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Ala Asp Phe Met 
515 

Lys Glu He Val 
530 

Lys Ser Ala Asp 
545 

Ser Thr lie Lys 

Asn His Asp Gly 

580 

Ala Gly Lys Gly 
595 

Tyr Glu lie Ala 
610 

Glu Lys Phe Ala 
625 

Asp Thr Gly Leu 

Ala Glu Tyr Val 

660 

Asp Met Pro Asp 
675 



lie 


Asn 


He 


Ala 








520 


Asp 


Pro 


Ser 


Phe 






535 




Asp 


Pro 


Lys 


He 




550 

W V 






Val 


ASD 


Thr 


Pro 


565 








Tvr 


Gly 


Glu 


Pro 


Arg 


Leu 


Trp 


Pro 








600 


Ala 


Glv 


Lvs 


ASD 






615 




Asn 


Glu 


Gly 


Gly 




630 






Pro 


Thr 


Asp 


Ser 


645 








He 


Leu 


Phe 


Ala 


He 


Val 


Tyr 





Asn Gly Gly Gly 

Leu Glu Leu Val 

540 

Leu Asn Thr Leu 
555 

Lys Gly Pro Ser 
570 

Ser Lys Thr Glu 
585 

Leu Leu Thr Gly 

Ala Thr Pro Tyr 

620 

He He Ser Glu 
635 

Ala Ser Pro Leu 
650 

Ser Asn He Glu 
665 



val 


Tyr 


Asp 


Gin 


525 








Arg 


Leu 


Gly 


Val 


Lys 


Val 


Val 


Asp 








560 


Trp 


Tyr 


Arg 


Tyr 






575 




Leu 


Tyr 


His 


Gly 




590 






Glu 


Arg 


Gly 


Met 


O V D 








val 


Lys 


Ala 


Met 


Gin 


val 


Trp 


Glu 








640 


Asn 


Trp 


Ala 


His 






655 




His 


Lys 


Val 


Leu 




670 







<210> 46 
<211> 2037 
<212> DNA 

<213> Thermoanaerobacterium thermosaccharolyt icum 
<220> 

<22 3 > synthetic 
<400> 48 

gtgctctccg gctgctccaa caacgtgtcc tccatcaaga tcgaccgctt caacaacatc 60 

tccgccgtga acggcccggg cgaggaggac acctgggcct ccgcccagaa gcagggcgtg 120 

ggcaccgcca acaactacgt gtcccgcgtg tggttcaccc tcgccaacgg cgccatctcc 180 

gaggtgtact acccgaccat cgacaccgcc gacgtgaagg agatcaagtt catcgtgacc 240 

gacggcaagt ccttcgtgtc cgacgagacc aaggacgcca tctccaaggt ggagaagttc 300 

accgacaagt ccctcggcta caagctcgtg aacaccgaca agaagggccg ctaccgcatc 360 

accaaggaaa tcttcaccga cgtgaagcgc aactccctca tcatgaaggc caagttcgag 420 

gccctcgagg gctccatcca cgactacaag ctctacctcg cctacgaccc gcacatcaag 480 

aaccagggct cctacaacga gggctacgtg atcaaggcca acaacaacga gatgctcatg 540 

gccaagcgcg acaacgtgta caccgccctc tcctccaaca tcggctggaa gggctactcc 600 

atcggctact acaaggtgaa cgacatcatg accgacctcg acgagaacaa gcagatgacc 660 

aagcactacg actccgcccg cggcaacatc atcgagggcg ccgagatcga cctcaccaag 720 

aactccgagt tcgagatcgt gctctccttc ggcggctccg actccgaggc cgccaagacc 780 

gccctcgaga ccctcggcga ggactacaac aacctcaaga acaactacat cgacgagtgg 840 

accaagtact gcaacaccct caacaacttc aacggcaagg ccaactccct ctactacaac 900 

cccatgatga tcctcaaggc ctccgaggac aagaccaaca agggcgccta catcgcctcc 960 

ctctccatcc cgtggggcga cggccagcgc gacgacaaca ccggcggcta ccacctcgtg 1020 

tggtcccgcg acctctacca cgtggccaac gccttcatcg ccgccggcga cgtggactcc 1080 

gccaaccgct ccctcgacta cctcgccaag gtggtgaagg acaacggcat gatcccgcag 1140 

aacacctgga tctccggcaa gccgtactgg acctccatcc agctcgacga gcaggccgac 1200 

ccgatcatcc tctcctaccg cctcaagcgc tacgacctct acgactccct cgtgaagccg 1260 
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ctcgccgact tcatcatcaa gatcggcccg aagaccggcc aggagcgctg ggaggagatc 1320 

ggcggctact ccccggccac gatggccgcc gaggtggccg gcctcacctg cgccgcctac 1380 

atcgccgagc agaacaagga ctacgagtcc gcccagaagt accaggagaa ggccgacaac 1440 

tggcagaagc tcatcgacaa cctcacctac accgagaacg gcccgctcgg caacggccag 1500 

tactacatcc gcatcgccgg cctctccgac ccgaacgccg acttcatgat caacatcgcc 1560 

aacggcggcg gcgtgtacga ccagaaggag atcgtggacc cgtccttcct cgagctggtg 1620 

cgcctcggcg tgaagtccgc cgacgacccg aagatcctca acaccctcaa ggtggtggac 1680 

tccaccatca aggtggacac cccgaagggc ccgtcctggt atcgctacaa ccacgacggc 1740 

tacggcgagc cgtccaagac cgagctgtac cacggcgccg gcaagggccg cctctggccg 1800 

ctcctcaccg gcgagcgcgg catgtacgag atcgccgccg gcaaggacgc caccccgtac 1860 

gtgaaggcga tggagaagtt cgccaacgag ggcggcatca tctccgagca ggtgtgggag 1920 

gacaccggcc tcccgaccga ctccgcctcc ccgctcaact gggcccacgc cgagtacgtg 1980 

atcctcttcg cctccaacat cgagcacaag gtgctcgaca tgccggacat cgtgtac 2037 

<210> 49 
<211> 579 
<212> PRT 

<213> Rhizopus oryzae 
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Tl a 
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LVS 

# 


Lys 
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i nr 
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Tin 

lie 


lyr 
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H 1 a 


Ion 
nop 




OCX 


A en 
Mo yj 


Ck q n 


i xp 








35 










4 0 










4 5 








Asn 


Asn 


Gly 


Asn 


Thr 


He 


Ala 


Ala 


Ser 


Tyr 


Ser 


Ala 


Pro 


lie 


Ser 


Gly 




50 










55 










60 










Ser 


Asn 


Tyr 


Glu 


Tyr 


Trp 


Thr 


Phe 


Ser 


Ala 


Ser 


lie 


Asn 


Gly 


He 


Lys 


65 










70 










75 










80 


Glu 


Phe 


Tyr 


He 


Lys 


Tyr 


Glu 


Val 


Ser 


Gly 


Lys 


Thr 


Tyr 


Tyr 


Asp 


Asn 










85 










90 










95 




Asn 


Asn 


Ser 


Ala 


Asn 


Tyr 


Gin 


Val 


Ser 


Thr 


Ser 


Lys 


Pro 


Thr 


Thr 


Thr 








100 










105 










110 






Thr 


Ala 


Thr 


Ala 


Thr 


Thr 


Thr 


Thr 


Ala 


Pro 


Ser 


Thr 


Ser 


Thr 


Thr 


Thr 






115 










120 










125 








Pro 


Pro 


Ser 


Arg 


Ser 


Glu 


Pro 


Ala 


Thr 


Phe 


Pro 


Thr 


Gly 


Asn 


Ser 


Thr 




130 










135 










140 










lie 


Ser 


Ser 


Trp 


He 


Lys 


Lys 


Gin 


Glu 


Gly 


lie 


Ser 


Arg 


Phe 


Ala 


Met 


145 










150 










155 










160 


Leu 


Arg 


Asn 


He 


Asn 


Pro 


Pro 


Gly 


Ser 


Ala 


Thr 


Gly 


Phe 


He 


Ala 


Ala 










165 










170 










175 




Ser 


Leu 


Ser 


Thr 


Ala 


Gly 


Pro 


Asp 


Tyr 


Tyr 


Tyr 


Ala 


Trp 


Thr 


Arg 


Asp 








180 










185 










190 






Ala 


Ala 


Leu 


Thr 


Ser 


Asn 


Val 


He 


Val 


Tyr 


Glu 


Tyr 


Asn 


Thr 


Thr 


Leu 






195 










200 










205 








Ser 


Gly 


Asn 


Lys 


Thr 


He 


Leu 


Asn 


Val 


Leu 


Lys 


Asp 


Tyr 


Val 


Thr 


Phe 




210 










215 










220 










Ser 


Val 


Lys 


Thr 


Gin 


Ser 


Thr 


Ser 


Thr 


Val 


Cys 


Asn 


Cys 


Leu 


Gly 


Glu 


225 










230 










235 










240 


Pro 


Lys 


Phe 


Asn 


Pro 


Asp 


Ala 


Ser 


Gly 


Tyr 


Thr 


Gly 


Ala 


Trp 


Gly 


Arg 










245 










250 










255 




Pro 


Gin 


Asn 


Asp 


Gly 


Pro 


Ala 


Glu 


Arg 


Ala 


Thr 


Thr 


Phe 


lie 


Leu 


Phe 








260 










265 










270 
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Ala 


Asp 


Ser 


Tyr 






275 




Thr 


Leu 


Lvs 


Pro 




290 






Trp 


Ser 


Asn 


Gly 


305 








Phe 


Tvr 


Thr 


Leu 


Phe 


Ala 


Lys 


Arg 








340 


Thr 


Ala 


Ser 


Thr 






3 55 




Asn 


Asn 


Trp 


lie 




370 






Lys 


Gly 


Leu 


Asp 


3 85 








Asp 


Asp 


Gly 


Phe 


Val 


Ala 


Val 


Glu 








420 


Leu 


Pro 


Ser 


Tvr 






435 




Tvr 


Asn 


Gly 


Asn 




450 






Thr 

* « 1 A. 


Gly Tyr 


Al a 


4 65 








Asn 


Gly Gly 


Val 


Phe 


Asp 


Ser 


Ser 








500 




Phe 


Asn 


Sen 
Mb II 






515 




Leu 


Ser 


Thr 


Val 




530 






Glu 


Phe 


Asp 


Arg 


545 








Trp 


Ser 


His 


Ala 


Pro 


Ala 


Ala 





Leu Thr Gin Thr 

280 

Ala lie Phe Lys 
295 

Cys Phe Asp Leu 
310 

Met Val Met Arg 
325 

Asn Gly Asp Ser 

lie Ala Asn Lys 

360 

Gin Val Ser Gin 
375 

Val Ser Thr Leu 
390 

Phe Thr Pro Gly 
405 

Asp Ser Phe Ala 

Leu Gly Asn Ser 

440 

Gly Asn Ser Gin 
455 

Glu Leu Tyr Tyr 
470 

Thr Val Ser Ser 
485 

Ala Thr Ser Gly 

Leu Ala Gin Asn 

520 

Gin Leu His Ala 
535 

Thr Thr Gly Leu 
550 

Ser Leu lie Thr 
565 



Lys Asp Ala Ser 

Asp Leu Asp Tyr 

300 

Trp Glu Glu Val 
315 

Lys Gly Leu Leu 
330 

Thr Arg Ala Ser 
345 

He Ser Ser Phe 

Ser Val Thr Gly 

380 

Leu Ala Ala Asn 
395 

Ser Glu Lys He 
410 

Ser Leu Tyr Pro 
425 

lie Gly Arg Tyr 

Gly Asn Ser Trp 

460 

Arg Ala He Lys 
475 

He Ser Leu Pro 
490 

Lys Lys Tyr Thr 
505 

He Ala Leu Ala 

His Asn Asn Gly 

540 

Ser Thr Gly Ala 
555 

Ala Ser Tyr Ala 
570 



Tyr 


Val 


Thr 


Gly 

4 


285 








Val 


Val 


Asn 


Val 


Asn 


Gly 


Val 


His 








320 


Leu 


Gly 


Ala 


Asp 






335 




Thr 


Tyr 


Ser 


Ser 




350 






Trp 

tr 


Val 


Ser 


Ser 


365 








Gly 


Val 


Ser 


Lys 


Leu 


Gly 


Ser 


Val 








400 


Leu 


Ala 


Thr 


Ala 






415 




He 


Asn 


Lys 


Asn 




430 






Pro 


Glu 


Asp 


Thr 


445 








Phe 


Leu 


Ala 


Val 


Glu 


Trp 


He 


Gly 








480 


Phe 


Phe 


Lys 


Lys 






495 




Val 


Gly 


Thr 


Ser 




510 






Ala 


Asp 


Arg 


Phe 


525 








Ser 


Leu 


Ala 


Glu 


Arg 


Asp 


Leu 


Thr 








560 


Lys 


Ala 


Gly 


Ala 






575 





<210> 50 
<211> 1737 
<212> DNA 

<213> Rhizopus oryzae 

<400> 50 

gcctccatcc cgtcctccgc ctccgtgcag ctcgactcct acaactacga cggctccacc 60 

ttctccggca aaatctacgt gaagaacatc gcctactcca agaaggtgac cgtgatctac 120 

gccgacggct ccgacaactg gaacaacaac ggcaacacca tcgccgcctc ctactccgcc 180 

ccgatctccg gctccaacta cgagtactgg accttctccg cctccatcaa eggcatcaag 240 

gagttctaca tcaagtacga ggtgtccggc aagacctact acgacaacaa caactccgcc 300 

aactaccagg tgtccacctc caagccgacc accaccaccg ccaccgccac caccaccacc 360 
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gccccgtcca cctccaccac caccccgccg tcccgctccg agccggccac cttcccgacc 420 

ggcaactcca ccatctcctc ctggatcaag aagcaggagg gcatctcccg cttcgccatg 480 

ctccgcaaca tcaacccgcc gggctccgcc accggcttca tcgccgcctc cctctccacc 540 

gccggcccgg actactacta cgcctggacc cgcgacgccg ccctcacctc caacgtgatc 600 

gtgtacgagc acaacaccac cctctccggc aacaagacca tcctcaacgt gctcaaggac 660 

tacgtgacct tctccgtgaa gacccagtcc acctccaccg tgtgcaactg cctcggcgag 720 

ccgaagttca acccggacgc ctccggctac accggcgcct ggggccgccc gcagaacgac 780 

ggcccggccg agcgcgccac caccttcatc ctcttcgccg actcctacct cacccagacc 840 

aaggacgcct cctacgtgac cggcaccctc aagccggcca tcttcaagga cctcgactac 900 

gtggtgaacg tgtggtccaa cggctgcttc gacctctggg aggaggtgaa cggcgtgcac 960 

ttctacaccc tcatggtgat gcgcaagggc ctcctcctcg gcgccgactt cgccaagcgc 1020 

aacggcgact ccacccgcgc ctccacctac tcctccaccg cctccaccat cgccaacaaa 1080 

atctcctcct tctgggtgtc ctccaacaac tggatacagg tgtcccagtc cgtgaccggc 1140 

ggcgtgtcca agaagggcct cgacgtgtcc accctcctcg ccgccaacct cggctccgtg 1200 

gacgacggct tcttcacccc gggctccgag aagatcctcg ccaccgccgt ggccgtggag 1260 

gactccttcg cctccctcta cccgatcaac aagaacctcc cgtcctacct cggcaactcc 1320 

atcggccgct acccggagga cacctacaac ggcaacggca actcccaggg caactcctgg 1380 

ttcctcgccg tgaccggcta cgccgagctg tactaccgcg ccatcaagga gtggatcggc 1440 

aacggcggcg tgaccgtgtc ctccatctcc ctcccgttct tcaagaagtt cgactcctcc 1500 

gccacctccg gcaagaagta caccgtgggc acctccgact tcaacaacct cgcccagaac 1560 

atcgccctcg ccgccgaccg cttcctctcc accgtgcagc tccacgccca caacaacggc 1620 

tccctcgccg aggagttcga ccgcaccacc ggcctctcca ccggcgcccg cgacctcacc 1680 

tggtcccacg cctccctcat caccgcctcc tacgccaagg ccggcgcccc ggccgcc 1737 

<210> 51 
<211> 439 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 51 

Met Ala Lys His Leu Ala Ala Met Cys Trp Cys Ser Leu Leu Val Leu 

15 10 15 

Val Leu Leu Cys Leu Gly Ser Gin Leu Ala Gin Ser Gin Val Leu Phe 

20 25 30 

Gin Gly Phe Asn Trp Glu Ser Trp Lys Lys Gin Gly Gly Trp Tyr Asn 

35 40 45 

Tyr Leu Leu Gly Arg Val Asp Asp He Ala Ala Thr Gly Ala Thr His 

50 55 60 

Val Trp Leu Pro Gin Pro Ser His Ser Val Ala Pro Gin Gly Tyr Met 
65 70 75 80 

Pro Gly Arg Leu Tyr Asp Leu Asp Ala Ser Lys Tyr Gly Thr His Ala 

85 90 95 

Glu Leu Lys Ser Leu Thr Ala Ala Phe His Ala Lys Gly Val Gin Cys 

100 105 110 

Val Ala Asp Val Val He Asn His Arg Cys Ala Asp Tyr Lys Asp Gly 

115 120 125 

Arg Gly He Tyr Cys Val Phe Glu Gly Gly Thr Pro Asp Ser Arg Leu 

130 135 140 

Asp Trp Gly Pro Asp Met He Cys Ser Asp Asp Thr Gin Tyr Ser Asn 
145 150 155 160 

Gly Arg Gly His Arg Asp Thr Gly Ala Asp Phe Ala Ala Ala Pro Asp 
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165 



lie 


Asp 


His 


Leu 


Asn 


Pro 


Arg 


Val 








180 










Asn 


Trp 


Leu 


Lys 


Ser 


Asp 


Leu 


Gly 






195 










200 


Ala 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ala 


Val 




210 










215 




Ala 


Pro 


Thr 


Phe 


Val 


Val 


Ala 


Glu 


225 










230 






Gly Asn 


Gly 


Glu 


Pro 


Ser 


Ser 


Asn 










245 








Val 


Asn 


Tro 


Ala 


Gin 


Ala 


Val 


Gly 








260 










Thr 


Thr 


Lvs 


Gly 


Val 


Leu 


Gin 


Ala 






275 










280 


Met 


Lys 


Asp 


Gly 


Asn 


Gly 


Lys 


Ala 




290 










295 




Glu 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


Asp 


305 










310 






Asn 


Ser 


Tro 


Pro 


Phe 


Pro 


Ser 


Asp 










325 








lie 


Leu 


Thr 


His 


Pro 


Gly 


Thr 


Pro 








340 










Asp 


Trp 


Asn 


Leu 


Lvs 


Gin 


Glu 


He 






355 










360 


Arg 


Asn 


Gly 


lie 


His 


Pro 


Gly 


Ser 




370 










375 




Gly 


Asp 


Leu 


Tyr 


Val 


Ala 


Lys 


He 


385 










390 






Gly 


Ser 


Arg 


Tyr 


Asp 


Val 


Gly 


Asn 










405 








Val 


Ala 


His 


Gly 


Asn 


Asn 


Tyr 


Cys 








420 










Val 


Pro 


Ala 


Gly 


Arg 


His 


His 







170 










175 




Gin 


Gin 


Glu 


Leu 


Ser 


Asp 


Trp 


Leu 


185 










190 






Phe 


Asp 


Gly 


Trp 


Arg 


Leu 


Asp 


Phe 










205 








Ala 


Lys 


Val 


Tyr 


Val 


Asp 


Ser 


Thr 








220 










lie 


Trp 

* 


Ser 


Ser 


Leu 


His 


Tyr 


Asp 






235 










240 


Gin 


Asp 


Ala 


Asp 


Arg 


Gin 


Glu 


Leu 




250 










255 




Gly 


Pro 


Ala 


Ala 


Ala 


Phe 


Asp 


Phe 


265 










270 






Ala 


Val 


Gin 


Gly 


Glu 


Leu 


Trp 


Arg 










285 








Pro 


Gly 


Met 


lie 


Gly 


Trp 


Leu 


Pro 








300 










Asn 


His 


Asp 


Thr 


Gly 


Ser 


Thr 


Gin 






315 










320 


Lys 


Val 


Met 


Gin 


Gly 


Tyr 


Ala 


Tyr 




330 










335 




Cys 


He 


Phe 


Tyr 


Asp 


His 


Val 


Phe 


345 










350 






Ser 


Ala 


Leu 


Ser 


Ala 


Val 


Arg 


Ser 










365 








Glu 


Leu 


Asn 


He 


Leu 


Ala 


Ala 


Asp 








380 










Asp 


Asp 


Lys 


Val 


He 


Val 


Lys 


He 






395 










400 


Leu 


He 


Pro 


Ser 


Asp 


Phe 


His 


Ala 




410 










415 




Val 


Trp 


Glu 


Lys 


His 


Gly 


Leu 


Arg 


425 










430 







<210> 52 
<211> 1320 
<212> DNA 

<213> Artificial Sequence 
<220> 

< 2 2 3 > synthetic 
<400> 52 

atggcgaagc acttggctgc catgtgctgg tgcagcctcc tagtgcttgt actgctctgc 60 
ttgggctccc agctggccca atcccaggtc ctcttccagg ggttcaactg ggagtcgtgg 120 
aagaagcaag gtgggtggta caactacctc ctggggcggg tggacgacat cgccgcgacg 180 
ggggccacgc acgtctggct cccgcagccg tcgcactcgg tggcgccgca ggggtacatg 240 
cccggccggc tctacgacct ggacgcgtcc aagtacggca cccacgcgga gctcaagtcg 300 
ctcaccgcgg cgttccacgc caagggcgtc cagtgcgtcg ccgacgtcgt gatcaaccac 360 
cgctgcgccg actacaagga cggccgcggc atctactgcg tcttcgaggg cggcacgccc 420 
gacagccgcc tcgactgggg ccccgacatg atctgcagcg acgacacgca gtactccaac 4 80 
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gggcgcgggc accgcgacac gggggccgac ttcgccgccg cgcccgacat cgaccacctc 540 
aacccgcgcg tgcagcagga gctctcggac tggctcaact ggctcaagtc cgacctcggc 600 
ttcgacggct ggcgcctcga cttcgccaag ggctactccg ccgccgtcgc caaggtgtac 660 
gtcgacagca ccgcccccac cttcgtcgtc gccgagatat ggagctccct ccactacgac 720 
ggcaacggcg agccgtccag caaccaggac gccgacaggc aggagctggt caactgggcg 780 
caggcggtgg gcggccccgc cgcggcgttc gacttcacca ccaagggcgt gctgcaggcg 840 
gccgtccagg gcgagctgtg gcgcatgaag gacggcaacg gcaaggcgcc cgggatgatc 900 
ggctggctgc cggagaaggc cgtcacgttc gtcgacaacc acgacaccgg ctccacgcag 960 
aactcgtggc cattcccctc cgacaaggtc atgcagggct acgcctatat cctcacgcac 1020 
ccaggaactc catgcatctt ctacgaccac gttttcgact ggaacctgaa gcaggagatc 1080 
agcgcgctgt ctgcggtgag gtcaagaaac gggatccacc cggggagcga gctgaacatc 1140 
ctcgccgccg acggggatct ctacgtcgcc aagattgacg acaaggtcat cgtgaagatc 1200 
gggtcacggt acgacgtcgg gaacctgatc ccctcagact tccacgccgt tgcccctggc 1260 
aacaactact gcgtttggga gaagcacggt ctgagagttc cagcggggcg gcaccactag 1320 



<2io> 53 
<211> 45 
<2\2> PRT 

< 2 1 3 > Artificial Sequence 
<220> 

<223> synthetic 
<400> 53 

Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly 

15 10 15 

Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr 

20 25 30 

Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val 
35 40 45 



<210> 54 
<211> 137 
<212> DNA 

<213> Artificial Sequence 
<220> 

c223> synthetic 
<400> 54 

gccaccggcg gcaccaccac caccgccacc accaccggct ccggcggcgt gacctccacc 60 
cccaagacca ccaccaccgc ctccaagacc tccaccacca cctcctccac ctcctgcacc 120 
accccgaccg ccgtgtc 137 

<210> 55 
<211> 300 
<212> PRT 

<213> Pyrococcus furiosus 
c400> 55 

He Tyr Phe Val Glu Lys Tyr His Thr Ser Glu Asp Lys Ser Thr Ser 

1 5 10 15 

Asn Thr Ser Ser Thr Pro Pro Gin Thr Thr Leu Ser Thr Thr Lys Val 
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20 25 30 



Leu 


Lys 


lie 


Arg 


Tyr 


Pro 


Asp 


Asp 


Gly 


Glu 


Trp 


Pro 


Gly 


Ala 


Pro 


lie 






35 










40 










45 








Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Asn 


Pro 


Glu 


Phe 


Tyr 


He 


Glu 


He 


Asn 


Leu 




50 










55 










60 










Trp 


Asn 


lie 


Leu 


Asn 


Ala 


Thr 


Gly 


Phe 


Ala 


Glu 


Met 


Thr 


Tyr 


Asn 


Leu 


65 










70 










75 










80 


Thr 


Ser 


Gly 


Val 


Leu 


His 


Tyr 


val 


Gin 


Gin 


Leu 


Asp 


Asn 


lie 


val 


Leu 










85 










90 










95 




Arq 


Asp 


Arq 


Ser 


Asn 


Trp 


Val 


His 


Gly 


Tyr 


Pro 


Glu 


He 


Phe 


Tyr 


Gly 








100 










105 










110 






Asn 


Lys 


Pro 


Trp 


Asn 


Ala 


Asn 


Tyr 


Ala 


Thr 


Asp 


Gly 


Pro 


lie 


Pro 


Leu 






115 










120 










125 








Pro 


Ser 


Lys 


Val 


Ser 


Asn 


Leu 


Thr 


Asp 


Phe 


Tyr 


Leu 


Thr 


lie 


Ser 


Tyr 




130 










135 










140 










Lys 


Leu 


Glu 


Pro 


Lys 

4 


Asn 


Gly 


Leu 


Pro 


He 


Asn 


Phe 


Ala 


lie 


Glu 


Ser 


145 










150 










155 










160 


Trp 


Leu 


Thr 


Arg 


Glu 


Ala 


Trp 


Arg 


Thr 


Thr 


Gly 


He 


Asn 


Ser 


Asp 


Glu 










165 










170 










175 




Gin 


Glu 


Val 


Met 


He 


Trp 


He 


Tyr 


Tyr 


Asp 


Gly 


Leu 


Gin 


Pro 


Ala 


Gly 








180 










185 










190 






Ser 


Lys 


Val 


Lys 
* 


Glu 


He 


Val 


Val 


Pro 


He 


lie 


Val 


Asn 


Gly 


Thr 


Pro 






195 










200 










205 








Val 


Asn 


Ala 


Thr 


Phe 


Glu 


Val 


Trp 


Lys 


Ala 


Asn 


He 


Gly 


Trp 


Glu 


Tyr 




210 










215 










220 










Val 


Ala 


Phe 


Arg 


He 


Lys 


Thr 


Pro 


He 


Lys 


Glu 


Gly 


Thr 


Val 


Thr 


He 


225 










230 










235 










240 


Pro 


Tyr 


Gly 


Ala 


Phe 


He 


Ser 


Val 


Ala 


Ala 


Asn 


lie 


Ser 


Ser 


Leu 


Pro 






245 










250 










255 




Asn 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Leu 


Glu 


ASp 


Val 


Glu 


lie 


Gly 


Thr 


Glu 


Phe 






260 










265 










270 






Gly Thr 


Pro 


Ser 


Thr 


Thr 


Ser 


Ala 


His 


Leu 


Glu 


Trp 


Trp 


lie 


Thr 


Asn 






275 










280 










285 








lie 


Thr 


Leu 


Thr 


Pro 


Leu 


Asp 


Arg 


Pro 


Leu 


He 


Ser 












290 










295 










300 











<210> 56 
<211> 903 
c212> DNA 

<213> Pyrococcus furiosus 
<400> 56 

atctacttcg tggagaagta ccacacctcc gaggacaagt ccacctccaa cacctcctcc 60 
accccgccgc agaccaccct ctccaccacc aaggtgctca agatccgcta cccggacgac 120 
ggcgagtggc ccggcgcccc gatcgacaag gacggcgacg gcaacccgga gttctacatc 180 
gagatcaacc tctggaacat cctcaacgcc accggcttcg ccgagatgac ctacaacctc 240 
actagtggcg tgctccacta cgtgcagcag ctcgacaaca tcgtgctccg cgaccgctcc 300 
aactgggtgc acggctaccc ggaaatcttc tacggcaaca agccgtggaa cgccaactac 360 
gccaccgacg gcccgatccc gctcccgtcc aaggtgtcca acctcaccga cttctacctc 420 
accatctcct acaagctcga gccgaagaac ggtctcccga tcaacttcgc catcgagtcc 480 
tggctcaccc gcgaggcctg gcgcaccacc ggcatcaact ccgacgagca ggaggtgatg 54 0 
atctggatct actacgacgg cctccagccc gcgggctcca aggtgaagga gatcgtggtg 600 
ccgatcatcg tgaacggcac cccggtgaac gccaccttcg aggtgtggaa ggccaacatc 660 
ggctgggagt acgtggcctt ccgcatcaag accccgatca aggagggcac cgtgaccatc 720 

228 



t 



WO 2005/096804 



PCT/US2004/007182 



ccgtacggcg ccttcatctc 
aagtacctcg aggacgtgga 
cacctcgagt ggtggatcac 
tag 



cgtggccgcc aacatctcct 
gatcggcacc gagttcggca 
caacatcacc ctcaccccgc 



ccctcccgaa ctacaccgag 780 
ccccgtccac cacctccgcc 840 
tcgaccgccc gctcatctcc 900 

903 



<210> 57 

<2U> 387 

<212> PRT 

<213> Thermus flavus 



<400> 57 
Met Tyr Glu Pro 
1 

Val Asp Asn Val 

20 

Leu Asp Pro Val 
35 

Gly Val Asn Leu 
50 

Gin Glu Arg Asp 
65 

Thr Val Leu Lys 

Ala Phe Arg Asp 

100 

Tyr Ala Leu Arg 
115 

Gly Ala Glu He 
130 

Glu Ser Thr Asp 
145 

Leu Asn Phe Met 

Phe Ser Val Glu 

180 

Thr Thr Val Gly 
195 

Glu Arg Phe Gly 
210 

Leu Asn Phe Asp 

225 

Phe His He Asp 

Leu Arg Phe Gly 

260 

Leu Leu Glu Ser 
275 

Ala Leu Arg Thr 
290 

Cys Met Arg Thr 
305 

Glu Asp Pro Glu 
Pro Ala Thr Leu 



Lys Pro Glu His 

5 

Asp Arg Asp Pro 

Tyr Val Val His 

40 

His Asp Glu Asp 
55 

Gin He Val Arg 
70 

Val Pro Met Val 
85 

Gly Ala Ser Thr 

Lys Ser Leu Glu 

120 

Tyr Met Phe Trp 
135 

Lys Thr Arg Lys 
150 

Thr Ala Tyr Thr 
165 

Pro Lys Pro Asn 

Ser Met Leu Ala 

200 

Leu Asn Pro Glu 
215 

His Ala Val Ala 
230 

Leu Asn Asp Gin 
245 

Ser Glu Asn Leu 

Ser Gly Tyr Gin 

280 

Glu Asp Glu Glu 
295 

Tyr Leu He He 
310 

Val Lys Glu Leu 
325 

Ala Leu Leu Asp 



Arg 


pne 


rnr 


O V-i a 

pne 




i n 






pne 




ASp 


i nr 










Lys 


beu 


Ala 


pin 

bill 


Leu 


He 


Pro 


Arg 








b U 


Arg 


pne 


Lys 


Lys 






"7 R 




Thr 


Aia 


A r—k 

Asn 


beu 




on 






Thr 


Arg 


ASp 


T""\ w* 

pro 


1 AC 

1 \) D 








i nr 


rie c 


T\. 

ASp 




Met 


Val 


Arg 


Glu 








14 U 


va i 


Trp 


ASp 


irp 






1 ^ 




pi„ 


ASp 


win 






1 10 

X f V 






Pin 


D y /*> 


ft v"/-» 




10 j 








Leu 


He 


His 


Thr 


Phe 


Ala 


His 


Glu 








220 


Gin 


Ala 


Val 


Asp 






235 




Arg 


Met 


Ser 


Arg 




250 






Lys 


Ala 


Gly 


Phe 


265 








Gly 


Pro 


Arg 


His 


Gly 


val 


Trp 


Thr 








300 


Lys 


Val 


Arg 


Ala 






315 




Leu 


Ala 


Ala 


Tyr 




330 






Pro 


Tyr 


Ser 


Arg 



Hi \r 
KJXy 


T All 

Lieu 


irp 


T'Vi V* 

i n i: 






X -J 




val 


Arg 


H, 1 ii 
UlU 


TV v-/—i 

Airg 




1 0 






T on 


pi v 

vj j. y 


aia 


iyr 


*i □ 








Hi \r 

vj ly 


l nr 


D v* <~v 


fro 


Ala 


Leu 


Asp 


Glu 










file 


O a v* 

be r 


p 1 ii 


D v~ f\ 

rro 






QC. 




Trp 


Wa 1 

v a i 


irp 


aia 
Ala 




1 IV 






\jiy 


Ala 

Aia 


UlU 


Lieu 


it J 








Arg 


Coy 

ber 


H,1 n 
UlU 


\7a 1 
Val 


Val 


Arg 


Glu 


Thr 








X \> V 


iyr 


ox y 


iyr 








175 

X t *J 






Tie 

IXC 




rue 




190 

£p it v 






Leu 


Asp 


Arg 


Pro 


205 








Thr 


Met 


Ala 


Gly 


Ala 


Gly 


Lys 


Leu 








240 


Phe 


Asp 


Gin 


Asp 






255 




Phe 


Leu 


Val 


Asp 




270 






Phe 


Glu 


Ala 


His 


285 








Phe 


Val 


Arg 


Val 


Glu 


Thr 


Phe 


Arg 








320 


Tyr 


Gin 


Glu 


Asp 






335 




Glu 


Lys 


Ala 


Glu 
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340 

Ala Leu Lys Arg Ala Glu Leu Pro 
355 360 
Tyr Ala Leu Glu Arg Leu Asp Gin 

370 375 
Val Arg Gly 
385 



345 350 
Leu Glu Thr Lys Arg Arg Arg Gly 

365 

Leu Ala Val Glu Tyr Leu Leu Gly 

380 



<210> 58 
<211> 978 
<2\2> DMA 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 58 

atggggaaga acggcaacct gtgctgcttc tctctgctgc tgcttcttct cgccgggttg 60 

gcgtccggcc atcaaatcta cttcgtggag aagtaccaca cctccgagga caagtccacc 120 

tccaacacct cctccacccc gccgcagacc accctctcca ccaccaaggt gctcaagatc 180 

cgctacccgg acgacggtga gtggcccggc gccccgatcg acaaggacgg cgacggcaac 240 

ccggagttct acatcgagat caacctctgg aacatcctca acgccaccgg cttcgccgag 300 

atgacctaca acctcactag tggcgtgctc cactacgtgc agcagctcga caacatcgtg 360 

ctccgcgacc gctccaactg ggtgcacggc tacccggaaa tcttctacgg caacaagccg 420 

tggaacgcca actacgccac cgacggcccg atcccgctcc cgtccaaggt gtccaacctc 480 

accgacttct acctcaccat ctcctacaag ctcgagccga agaacggtct cccgatcaac 540 

ctcgccatcg agtcctggct cacccgcgag gcctggcgca ccaccggcat caactccgac 600 

gagcaggagg tgatgatctg gatctactac gacggcctcc agcccgcggg ctccaaggtg 660 

aaggagatcg tggtgccgat catcgtgaac ggcaccccgg tgaacgccac cttcgaggtg 720 

tggaaggcca acatcggctg ggagtacgtg gccttccgca tcaagacccc gatcaaggag 780 

ggcaccgtga ccatcccgta cggcgccttc atctccgtgg ccgccaacat ctcctccctc 840 

ccgaactaca ccgagaagta cctcgaggac gtggagatcg gcaccgagtt cggcaccccg 900 

tccaccacct ccgcccacct cgagtggtgg atcaccaaca tcaccctcac cccgctcgac 960 

cgcccgctca tctcctag 978 



<210> 59 
<211> 1920 
<212> DNA 

<213> Aspergillus niger 



<400> 59 

atgtccttcc gctccctcct cgccctctcc 
atctccaagc gcgccaccct cgactcctgg 
gccatcctca acaacatcgg cgccgacggc 
gtggtggcct ccccgtccac cgacaacccg 
ggcctcgtgc tcaagaccct cgtggacctc 
accatcgaga actacatctc cgcccaggcc 
gacctctcct ccggcgccgg cctcggcgag 



ggcctcgtgt gcaccggcct cgccaacgtg 60 

ctctccaacg aggccaccgt ggcccgcacc 120 

gcctgggtgt ccggcgccga ctccggcatc 180 

gactacttct acacctggac ccgcgactcc 240 

ttccgcaacg gcgacacctc cctcctctcc 300 

atcgtgcagg gcatctccaa cccgtccggc 360 

ccgaagttca acgtggacga gaccgcctac 420 
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accggctcct ggggccgccc 

ggcttcggcc agtggctcct 

ccgctcgtgc gcaacgacct 

ctctgggagg aggtgaacgg 

5 gtggagggct ccgccttcgc 

gccccggaga tcctctgcta 

ttcgactcct cccgctccgg 

gacccggagg ccgcctgcga 

aaccacaagg aggtggtgga 

10 gactccgagg ccgtggccgt 

tggttcctct gcaccctcgc 

aagcagggct ccctcgaggt 

gacgccgcca ccggcaccta 

gtgaagacct tcgccgacgg 

15 tccatgtccg agcagtacga 

tggtcctacg ccgccctcct 

tggggcgaga cctccgcctc 

acctactcct ccgtgaccgt 

accaccgcca ccccgaccgg 

20 gcctccaaga cctccacctc 

gtgaccttcg acctcaccgc 

atctcccagc tcggcgactg 

acctcctccg acccgctctg 

tacaagttca tccgcatcga 

25 gagtacaccg tgccgcaggc 



gcagcgcgac ggcccggccc 
cgacaacggc tacacctcca 
ctcctacgtg gcccagtact 
ctcctccttc ttcaccatcg 
caccgccgtg ggctcctcct 
cctccagtcc ttctggaccg 
caaggacgcc aacaccctcc 
cgactccacc ttccagccgt 
ctccttccgc tccatctaca 
gggccgctac ccggaggaca 
cgccgccgag cagctctacg 
gaccgacgtg tccctcgact 
ctcctcctcc tcctccacct 
cttcgtgtcc atcgtggaga 
caagtccgac ggcgagcagc 
caccgccaac aaccgccgca 
ctccgtgccg ggcacctgcg 
gacctcctgg ccgtccatcg 
ctccggcccc gcgacctcca 
cacctcctcc acctcctgca 
caccaccacc tacggcgaga 
ggagacctcc gacggcatcg 
gtacgtgacc gtgaccctcc 
gtccgacgac cccgtggagt 
ctgcggcacc tccaccgcca 



cccgcgccac cgccatgatc 4 80 
ccgccaccga catcgtgtgg 540 
ggaaccagac cggctacgac 600 
ccgtgcagca ccgcgccctc 660 
gctcctggtg cgactcccag 720 
gctccttcat cctcgccaac 780 
tcggctccat ccacaccttc 840 
gctccccgcg cgccctcgcc 900 
ccctcaacga cggcctctcc 960 
cctactacaa cggcaacccg 1020 
acgccctcta ccagtgggac 1080 
tcttcaaggc cctctactcc 1140 
actcctccat cgtggacgcc 1200 
cccacgccgc ctccaacggc 1260 
tctccgcccg cgacctcacc 1320 
actccgtggt gccggcctcc 1380 
ccgccacctc cgccatcggc 1440 
tggccaccgg cggcaccacc 1500 
cctccaagac caccgccacc 1560 
ccaccccgac cgccgtggcc 1620 
acatctacct cgtgggctcc 1680 
ccctctccgc cgacaagtac 1740 
cggccggcga gtccttcgag 1800 
gggagtccga cccgaaccgc 1860 
ccgtgaccga cacctggcgc 1920 



<210> 60 

<211> 6 

30 <212> PRT 

<213> Artificial Sequence 



35 



<220> 

<223> synthetic 
<400> 60 

Ser Glu Lys Asp Glu Leu 



40 

<210> 61 

<211> 561 

<212> DNA 

^15 <213> Artificial Sequence 

<220> 

<223> Xylanase BD7436 

50 

<220> 

<221> CDS 

<222> (1) . . (561) 

55 <4O0> 61 

atg get age acc ttc tac tgg cat ttg tgg acc gac ggc ate ggc acc 48 
Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly lie Gly Thr 
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10 



15 10 15 

gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 
Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 
5 20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 
Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

acc agg gtg ate aac tac aac get cat get ttc age gtg gtg ggc aac 192 
Thr Arg Val lie Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

15 get tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 240 
Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 288 
20 Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 
25 100 105 HO 



30 



agg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 
Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 



384 



ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 

130 135 140 

35 ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 

lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 150 155 160 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 528 

40 Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

age age ggc tac age aac gtg acc gtg tgg tag 561 

Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 
45 180 185 



<210> 62 

<211> 186 

50 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Synthetic Construct 



55 



c400> 62 
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Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly lie Gly Thr 
1 5 10 15 



5 val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 



Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
10 35 40 45 



Thr Arg Val lie Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

15 

Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

20 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 



25 Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 HO 



Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
30 115 120 125 



Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

35 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

40 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



45 Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 63 

50 <211> 561 

<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Xylanase BD6002A 



233 
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<220> 

<221> CDS 

<222> (1) . . (561) 

5 <400> 63 

atg get age acc gac tac tgg caa aac tgg acc gac ggc ggc ggc acc 48 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

10 gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 
Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 
15 Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 

35 40 45 

acc agg gtg ate aac tac aac get ggc get ttc age cca age ggc aac 192 
Thr Arg Val He Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
20 50 55 60 



25 



ggc tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 

65 70 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 



240 



288 



30 ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 110 

agg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 384 
35 Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 

115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 432 
Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
40 130 135 140 

ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

45 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 528 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

50 age age ggc tac age aac gtg acc gtg tgg tag 561 
Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



55 <210> 64 
<211> 186 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

5 

<400> 64 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

10 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

15 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 



20 Thr Arg Val He Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 



Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
25 65 70 75 80 



Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

30 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 HO 

35 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 



40 Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 



He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
45 145 150 155 160 



Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

50 

Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 

55 

<210> 65 
<211> 561 
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<212> DMA 

<213> Artificial Sequence 
<220> 

5 <223> Xylanase BD6002B 



<220> 

<221> CDS 



10 <222> (1) . . (561) 

<400> 65 

atg gcc tec acc gac tac tgg cag aac egg acc gac ggc ggc ggc acc 48 
Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 1 5 10 15 

gtg aac gcc acc aac ggc tec gac ggc aac tac tec gtg tec tgg tec 96 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

20 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc tec gcc 144 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

25 acc cgc gtg ate aac tac aac gcc ggc gcc ttc tec ccg tec ggc aac 192 
Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

ggc tac etc gcc etc tac ggc tgg acc cgc aac tec etc ate gag tac 240 
30 Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr • 
65 70 75 80 

tac gtg gtg gac tec tgg ggc acc tac cgc ccg acc ggc acc tac aag 288 
Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 
35 85 90 95 

ggc acc gtg acc tec gac ggc ggc acc tac gac ate tac acc acc acc 336 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

40 

cgc acc aac gcc ccg tec ate gac ggc aac aac acc acc ttc acc cag 384 
Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

45 ttc tgg tec gtg cgc cag tec aag cgc ccg ate ggc acc aac aac acc 432 
Phe Trp Ser Val Arg Gin Ser Lys Arg Pro lie Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc tec aac cac gtg aac gcc tgg aag tec aag ggc atg aac 480 
50 lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

etc ggc tec tec tgg tec tac cag gtg etc gcc acc gag ggc tac cag 528 
Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 
55 165 170 175 

tec tec ggc tac tec aac gtg acc gtg tgg tga 561 
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Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



5 <210> 66 

<211> 186 

<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Synthetic Construct 

<400> 66 

15 Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
1 5 10 15 



Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 
20 20 25 30 



Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

25 

Thr Arg Val He Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

30 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 



35 Tvr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

8 5 90 95 



Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 
40 100 105 HO 



Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

45 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

50 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 



55 Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 
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Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 



<210> 67 

<21l> 2071 

<212> DNA 

< 2 1 3 > Oryza sativa 

<220> 

<221> misc_f eature 
<222> (1) . . (2071) 
<223> Promoter 

<400> 67 

tccatgctgt cctactactt gcttcatccc cttctacatt ttgttctggt ttttggcctg 60 

catittcggat catgatgtat gtgatttcca atctgctgca atatgaatgg agactctgtg 120 

ctaaccatca acaacatgaa atgcttatga ggcctttgct gagcagccaa tcttgcctgt 180 

gtttatgtct tcacaggccg aattcctctg ttttgttttt caccctcaat atttggaaac 240 

acttatctag gttgtttgtg tccaggccta taaatcatac atgatgttgt cgtattggat 300 

gtgaatgtgg tggcgtgttc agtgccttgg atttgagttt gatgagagtt gcttctgggt 360 

caccactcac cattatcgat gctcctcttc agcataaggt aaaagtcttc cctgtttacg 420 

ttattttacc cactatggtt gcttgggttg gttttttcct gattgcttat gccatggaaa 480 

gtcatttgat atgttgaact tgaattaact gtagaattgt atacatgttc catttgtgtt 540 

gtacttcctt cttttctatt agtagcctca gatgagtgtg aaaaaaacag attatataac 600 

ttgccccata aatcatttga aaaaaatatt gtacagtgag aaattgatat atagtgaatt 660 

tttaagagca tgttttccta aagaagtata tattttctat gtacaaaggc cattgaagta 720 

attgtagata caggataatg tagacttttt ggacttacac tgctaccttt aagtaacaat 780 

catgagcaat agtgttgcaa tgatatttag gctgcattcg tttactctct tgatttccat 840 

gagcacgctt cccaaaccgt taaactctgt gttttttgcc aaaaaaaaat gcataggaaa 900 

gttgctttta aaaaatcata tcaatccatt ttttaagtta tagctaatac ttaattaatc 960 

atgcgctaat aagtcactct gtttttcgta ctagagagat tgttttgaac cagcactcaa 1020 

gaacacagcc ttaacccagc caaataatgc tacaacctac cagtccacac ctcttgtaaa 1080 

gcatttgttg catggaaaag ctaagatgac agcaacctgt tcaggaaaac aactgacaag 1140 

gtcataggga gagggagctt ttggaaaggt gccgtgcagt tcaaacaatt agttagcagt 1200 
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agggtgttgg 


tttttgctca 


cagcaataag 


aagttaatca 


tggtgtaggc 


aacccaaata 


1260 


aaacaccaaa 


atatgcacaa 


ggcagtttgt 


cgtattctgt 


agtacagaca 


aaactaaaag 


1320 


taatgaaaga 


agatgtggtg 


ttagaaaagg 


aaacaatatc 


atgagtaatg 


tgtgggcatt 


1380 


atgggaccac 


gaaataaaaa 


gaacattttg 


atgagtcgtg 


tatcctcgat 


gagcctcaaa 


1440 


agttctctca 


ccccggataa 


gaaaccctta 


agcaatgtgc 


aaagtttgca 


ttctccactg 


1500 


acataatgca 


aaataagata 


tcatcgatga 


catagcaact 


catgcatcat 


atcatgcctc 


1560 


tctcaaccta 


ttcattccta 


ctcatctaca 


taagtatctt 


cagctaaatg 


ttagaacata 


1620 


aacccataag 


tcacgtttga 


tgagcattag 


gcgtgacaca 


tgacaaatca 


cagactcaag 


1680 


caagataaag 


caaaatgatg 


tgtacataaa 


actccagagc 


tatatgtcat 


attgcaaaaa 


1740 


gaggagagct 


tataagacaa 


ggcatgactc 


acaaaaa ttc 


atttgccttt 


cgtgtcaaaa 


1800 


agaggagggc 


tttacattat 


ccatgtcata 


ttgcaaaaga 


aagagagaaa 


gaacaacaca 


1860 


atgccgcgtc 


aattatacat 


atctgtatgt 


ccatcat tat 


tcatccacct 


ttcgtgtacc 


1920 


acacttcata 


tatcatgagt 


cacttcatgt 


ctggacatta 


acaaactcta 


tcttaacatt 


1980 


tagatgcaag 


agcctttatc 


tcactataaa 


tgcacgatga 


tttctcattg 


tttctcacaa 


2040 


aaagcattca 


gttcattagt 


cctacaacaa 


c 






2071 



<210> 68 

<2U> 79 

<212> PRT 

<213> Zea mays 

<220> 

<221> SIGNAL 
<222> (1) . . (79) 

<223> Maize waxy signal sequence. 
<400> 68 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 

Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 25 30 

Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 

Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 
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50 55 60 



Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala 
65 70 75 



<210> 69 

<211> 1005 

10 <212> DNA 

<213> Artificial Sequence 



15 



30 



<220> 

<223> Synthetic Bromelain Sequence 



<220> 

<221> CDS 



<222 > (1) . . (1005) 
20 <223> Synthetic Bromelain 

<400> 69 

atg gcc tgg aag gtg cag gtg gtg ttc etc ttc etc ttc etc tgc gtg 48 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
25 15 10 15 

atg tgg gcc tec ccg tec gcc gcc tec gcg gac gag ccg tec gac ccg 96 
Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 



atg atg aag cgc ttc gag gag tgg atg gtg gag tac ggc cgc gtg tac 144 
Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 



35 aag gac aac gac gag aag atg cgc cgc ttc cag ate ttc aag aac aac 
Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin He Phe Lys Asn Asn 
50 55 60 



192 



gtg aac cac ate gag acc ttc aac tec cgc aac gag aac tec tac ace 240 
40 Val Asn His He Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 

etc ggc ate aac cag ttc acc gac atg acc aac aac gag ttc ate gcc 288 
Leu Gly He Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe He Ala 
45 85 90 95 

cag tac acc ggc ggc ate tec cgc ccg etc aac ate gag cgc gag ccg 336 
Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn lie Glu Arg Glu Pro 

100 105 HO 

50 

gtg gtg tec ttc gac gac gtg gac ate tec gcc gtg ccg cag tec ate 384 
Val Val Ser Phe Asp Asp Val Asp He Ser Ala Val Pro Gin Ser lie 
IIS 120 125 

55 gac tgg cgc gac tac ggc gcc gtg acc tec gtg aag aac cag aac ccg 432 
Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 135 140 
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5 



tgc ggc gcc tgc tgg gcc ttc gcc gcc ate gec acc gtg gag tec ate 480 

Cys Gly Ala Cys Trp Ala Phe Ala Ala lie Ala Thr Val Glu Ser lie 
145 150 155 160 

tac aag ate aag aag ggc ate etc gag ccg etc tec gag cag cag gtg 528 

Tyr Lys He Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 

165 170 175 

10 etc gac tgc gcc aag ggc tac ggc tgc aag ggc ggc tgg gag ttc cgc 576 

Leu Asp Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 185 190 

gcc ttc gag ttc ate ate tec aac aag ggc gtg gcc tec ggc gcc ate 624 

15 Ala Phe Glu Phe He He Ser Asn Lys Gly Val Ala Ser Gly Ala He 

195 200 205 

cac ccg tac aag gcc gcc aag ggc acc tgc aag acc gac ggc gtg ccg 672 

Tyr Pro Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 
20 210 215 220 

aac tec gcc tac ate acc ggc tac gcc cgc gtg ccg cgc aac aac gag 720 

Asn Ser Ala Tyr He Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 



25 



45 



50 



55 



225 230 235 240 

tec tec atg atg tac gcc gtg tec aag cag ccg ate acc gtg gcc gtg 

Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro lie Thr Val Ala Val 

245 250 255 



768 



30 gac gcc aac gcc aac ttc cag tac tac aag tec ggc gtg ttc aac ggc B16 

Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 

ccg tgc ggc acc tec etc aac cac gcc gtg acc gcc ate ggc tac ggc 864 

35 Pro Cys Gly Thr Ser Leu Asn His Ala Val Thr Ala He Gly Tyr Gly 

275 280 285 

cag gac tec ate ate tac ccg aag aag tgg ggc gcc aag tgg ggc gag 912 

Gin Asp Ser He He Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
40 290 295 300 

gcc ggc tac ate cgc atg gcc cgc gac gtg tec tec tec tec ggc ate 960 

Ala Gly Tyr lie Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly He 

305 310 315 320 



tgc ggc ate gcc ate gac ccg etc tac ccg acc etc gag gag tag 1005 
Cys Gly He Ala lie Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

325 330 



<210> 70 
<211> 334 
<212> PRT 



<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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<400> 70 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
5 1 5 10 15 



Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 

10 

Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 

15 

Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 
50 55 60 



20 Val Asn His He Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 



Leu Gly He Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe He Ala 
25 85 90 95 



Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn He Glu Arg Glu Pro 

100 105 110 

30 

Val Val Ser Phe Asp Asp Val Asp He Ser Ala Val Pro Gin Ser He 
115 120 125 

35 

Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 135 140 



40 Cys Gly Ala Cys Trp Ala Phe Ala Ala lie Ala Thr Val Glu Ser He 
145 150 155 160 



Tyr Lys He Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 
45 165 170 175 



Leu Asp Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 185 190 

50 

Ala Phe Glu Phe He He Ser Asn Lys Gly Val Ala Ser Gly Ala He 
195 200 205 

55 

Tyr Pro Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 
210 215 220 
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Asn Ser Ala Tyr He Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 
225 230 235 240 



Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro He Thr Val Ala Val 

245 250 255 



Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 



Pro Cys Gly Thr Ser Leu Asn His Ala Val Thr Ala He Gly Tyr Gly 
275 280 285 



Gin Asp Ser He He Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
290 295 300 



Ala Gly Tyr He Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly He 
305 310 315 320 



Cys Gly lie Ala He Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

325 330 



<210> 


71 


<211> 


78 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Bromealin signal sequence 


<400> 


71 



atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tctgcgtgat gtgggcctcc 60 
ccgtccgccg cctccgcc 78 



<210> 


72 


<211> 


26 


<212> 


PRT 


c213> 


Artificial Sequence 


<220> 




<223> 


Bromealin signal peptide 


<400> 


72 



Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
15 10 15 
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Met Trp Ala Ser Pro Ser Ala Ala Ser Ala 

20 25 

5 

<210> 73 
<211> 1050 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223 > pSYNHOOO 
<400> 73 



15 


atggcctgga 


aggtgcaggt 


ggtgttcctc 


ttcctcttcc 


tetgegtgat 


gtgggcctcc 


60 




ccgtccgccg 


cctccgcgga 


cgagccgtcc 


gacccgatga 


tgaagegett 


cgaggagtgg 


120 


20 


atggtggagt 


acggccgcgt 


gtacaaggac 


aacgacgaga 


agatgegecg 


cttccagatc 


180 


t tcaagaaca 


aegtgaacca 


catcgagacc 


ttcaactccc 


gcaacgagaa 


ctcctacacc 


240 




ctcggca tea 


accag ttcac 


cgacatgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


300 


25 


ggcatct ccc 


gcccgctcaa 


catcgagcgc 


gagccggtgg 


tgtccttcga 


cgacgtggac 


360 




atctccgccg 


tgccgcagtc 


catcgactgg 


cgcgactacg 


gcgccgtgac 


ctccgtgaag 


420 


30 


aaccagaacc 


cgtgcggcgc 


ctgctgggcc 


ttcgccgcca 


tcgccaccgt 


ggagtccatc 


480 


tacaagatca 


agaagggcat 


cctcgagccg 


ctctccgagc 


ageaggtget 


cgactgcgcc 


540 




aagggctacg 


getgeaaggg 


eggctgggag 


ttccgcgcct 


tcgagttcat 


catctccaac 


600 


35 


aagggcgtgg 


cctccggcgc 


catctacccg 


tacaaggecg 


ccaagggcac 


ctgcaagacc 


660 




gacggcgtgc 


cgaactccgc 


ctacatcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 


40 


tcctccatga 


tgtacgccgt 


gtccaagcag 


ccgatcaccg 


tggccgtgga 


cgccaacgcc 


780 


aacttccagt 


actacaagtc 


cggcgtgttc 


aacggcccgt 


gcggcacctc 


cctcaaccac 


840 




gccgtgaccg 


ccatcggcta 


cggccaggac 


tccatcatct 


acccgaagaa 


gtggggcgcc 


900 


45 


aagtggggcg 


aggceggcta 


catccgcatg 


gcccgcgacg 


tgtcctcctc 


ctccggcatc 


960 




tgeggcateg 


ccatcgaccc 


gctctacccg 


accctcgagg 


aggtgttcgc 


cgaggccatc 


1020 




gccgccaact 


ccaccctcgt 


ggecgagtag 








1050 



50 



<210> 74 

<2l\> 1067 

<212? DNA 

55 <213> Artificial Sequence 

<220> 



244 



5 



15 



25 



35 



40 



45 
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<223> pSYN11589 
<400> 74 



tggcctggaa 


ggtgcaggtg 


gtgttcctct 


tcctcttcct 


ctgcgtgatg 


tgggcctccc 


60 


cgtccgccgc 


ctccgcctcc 


tcctcctcct 


tcgccgactc 


caacccgatc 


cgcccggtga 


120 


ccgaccgcgc 


cgcctccacc 


gaegagcegt 


ccgacccgat 


gatgaagege 


ttcgaggagt 


180 


ggatggtgga 


gtacggccgc 


gtgtacaagg 


acaacgacga 


gaagatgege 


cgcttccaga 


240 


tcttcaagaa 


caacgtgaac 


cacatcgaga 


ccttcaactc 


ccgcaacgag 


aactcctaca 


300 


ccctcggcat 


caaccagttc 


accgacatga 


ccaacaacga 


gttcatcgcc 


cagtacaccg 


360 


gcggca tctc 


ccgcccgctc 


aacatcgagc 


gcgagccggt 


ggtgtccttc 


gacgacgtgg 


420 


acatctccgc 


cgtgccgcag 


t ccatcgact 


ggegegacta 


cggcgccgtg 


acctccgtga 


480 


agaaccagaa 


cccgtgcggc 


gcctgctggg 


ccttcgccgc 


catcgccacc 


gtggagtcca 


540 


tctacaagat 


caagaagggc 


atcctcgagc 


cgctctccga 


gcagcaggtg 


ctcgactgcg 


600 


ccaagggcta 


cggctgcaag 


ggcggctggg 


agttccgcgc 


cttcgagttc 


atcatctcca 


660 


acaagggcgt 


ggcctccggc 


gccatctacc 


egtacaagge 


cgccaagggc 


acctgeaaga 


720 


ccgacggcgt 


gccgaac tec 


gcctacatca 


ccggccacgc 


ccgcgtgccg 




1 ft 0 


agtcctccat 


gatgtacgee 


gtgtccaagc 


agccgatcac 


cgtggccgtg 


gacgccaacg 


840 


ccaacttcca 


gtactacaag 


tccggcgtgt 


tcaacggccc 


gtgcggcacc 


tccctcaacc 


900 


acgccgtgac 


cgccatcggc 


taeggecagg 


actccatcat 


ctacccgaag 


aagtggggcg 


960 


ccaagtgggg 


cgaggccggc 


tacatccgca 


tggcccgcga 


cgtgtcctcc 


tcctccggca 


1020 


tctgcggcat 


cgccatcgac 


ccgctctacc 


cgaccctcga 


ggagtag 




1067 



<210> 75 

<211> 1023 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN11587 Sequence 



<400> 75 

50 atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tetgegtgat gtgggcctcc 60 

ccgtccgccg cctccgcgga cgagccgtcc gacccgatga tgaagegett cgaggagtgg 120 

atggtggagt acggccgcgt gtacaaggac aacgacgaga agatgegecg cttccagatc 180 

55 

ttcaagaaca aegtgaacca catcgagacc ttcaactccc gcaacgagaa ctcctacacc 240 



245 
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ctcggcatca 


accagttcac 


cgacatgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


300 




ggcatctccc 


gcccgctcaa 


catcgagcgc 


gagccggtgg 


tgtccttcga 


cgacgtggac 


360 


5 


atctccgccg 


tgccgcagtc 


catcgactgg 


cgcgactacg 


gcgccgtgac 


ctccgtgaag 


420 




aaccagaacc 


cgtgcggcgc 


ctgctgggcc 


ttcgccgcca 


tcgccaccgt 


ggagtccatc 


480 


10 


tacaagatca 


agaagggcat 


cctcgagccg 


ctctccgagc 


agcaggtgct 


cgactgcgcc 


540 


aagggctacg 


gctgcaaggg 


cggctgggag 


ttccgcgcct 


tcgagttcat 


catctccaac 


600 




aagggcgtgg 


cctccggcgc 


catctacccg 


tacaaggccg 


ccaagggcac 


ctgcaagacc 


660 


15 


gacggcgtgc 


cgaactccgc 


ctacatcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 




tec tcca tga 


tgtacgccgt 


gtccaagcag 


ccgatcaccg 


tggccgtgga 


cgccaacgcc 


780 


20 


aact tccagt 


actacaagtc 


cggcgtgttc 


aacggcccgt 


gcggcacctc 


cctcaaccac 


840 


gccgtgaccg 


ccatcggcta 


cggccaggac 


tccat catct 


acccgaagaa 


gcggggcgcc 


o n r\ 






aggccggcta 


catccgcatg 


gcccgcgacg 


tgtcctcctc 


ctccggcatc 


960 


25 


tgcggcatcg 
tag 


ccatcgaccc 


gctctacccg 


accctcgagg 


agtccgagaa 


ggacgagctg 


1020 
1023 


30 


<210> 76 
<211> 990 
<212> DNA 

<213> Artificial Sequence 










35 


<220> 

<223> pSYN12169 Sequence 










40 


<400> 76 
atgagggtgt 


tgctcgttgc 


cctcgctctc 


ctggctctcg 


* 

ctgcgagcgc 


cacctccatg 


60 


gcggacgagc 


cgtccgaccc 


gatgatgaag 


cgcttcgagg 


agtggatggt 


ggagtacggc 


120 




cgcgtgtaca 


aggacaacga 


cgagaagatg 


cgccgcttcc 


agatcttcaa 


gaacaacgtg 


180 


45 


aaccacatcg 


agacct tcaa 


ctcccgcaac 


gagaactcct 


acaccctcgg 


catcaaccag 


240 




ttcaccgaca 


tgaccaacaa 


cgagttcatc 


gcccagtaca 


ccggcggcat 


ctcccgcccg 


300 




ctcaacatcg 


agcgcgagcc 


ggtggtgtcc 


ttcgacgacg 


tggacatctc 


cgccgtgccg 


360 


cagtccatcg 


actggcgcga 


ctacggcgcc 


gtgacctccg 


tgaagaacca 


gaacccgtgc 


420 




ggcgcctgct 


gggccttcgc 


cgccatcgcc 


accgtggagt 


ccatctacaa 


gatcaagaag 


480 


55 


ggcatcctcg 


agccgctctc 


cgagcagcag 


gtgctcgact 


gcgccaaggg 


ctacggctgc 


540 




aagggcggct 


gggagttccg 


cgccttcgag 


t tcatcatct 


ccaacaaggg 


cgtggcctcc 


600 
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ggcgcca tct 


acccgtacaa 


ggccgccaag 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


660 


tccgcctaca 


tcaccggcta 


cgcccgcgtg 


ccgcgcaaca 


acgagtcctc 


catgatgtac 


720 






naccat nacc 


at aa a caeca 
y ^yy a ^yv^a 


araccaactt 

C* M *• a a w w w 


ccaotactac 


780 


aagtccggcg 


tgttcaacgg 


cccgtgcggc 


acctccctca 


accacgccgt 


gaccgccatc 


840 


ggctacggcc 


aggactccat 


catctacccg 


aagaagtggg 


gcgccaagtg 


gggegaggee 


900 


ggctaca tec 


gcatggcccg 


cgacgtgtcc 


tcctcctccg 


geatctgegg 


catcgccatc 


960 


gacccgctct 


acccgaccct 


cgaggagtag 








990 



<210> 77 

<211> 1170 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN12575 Sequence 



< 4 00 > 7 7 

atgctggcgg 


ctctggccac 


gtcgcagctc 


gtcgcaacgc 


gcgccggcct 


gggcgtcccg 


60 


gacgcgtcca 


cgttccgccg 


cggcgccgcg 


cagggectga 


ggggggcccg 


ggcgtcggcg 


120 


geggeggaca 


cgctcagcat 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


180 


caggcgcgcc 


gcggggccag 


gttcccgtcg 


ctcgtcgtgt 


gcgccagcgc 


cggcgccatg 


240 


geggacgage 


cgtccgaccc 


gatgatgaag 


cgcttcgagg 


agtggatggt 


ggagtaegge 


300 


cgcgtgtaca 


aggacaacga 


cgagaagatg 


cgccgcttcc 


agatcttcaa 


gaacaacgtg 


360 


aaccacatcg 


agaccttcaa 


ctcccgcaac 


gagaactcct 


acaccctcgg 


catcaaccag 


420 


ttcaccgaca 


tgaccaacaa 


cgagttcatc 


gcccagtaca 


ccggcggcat 


ctcccgcccg 


480 


ctcaacatcg 


agcgcgagcc 


ggtggtgtcc 


ttcgacgacg 


tggacatctc 


cgccgtgccg 


540 


cagtccatcg 


actggegega 


ctacggcgcc 


gtgacctccg 


tgaagaacca 


gaacccgtgc 


600 


ggcgcctgct 


gggccttcgc 


cgccatcgcc 


accgtggagt 


ccatctacaa 


gatcaagaag 


660 


ggcatcctcg 


agccgctctc 


cgagcagcag 


gtgetcgact 


gcgccaaggg 


ctacggctgc 


720 


aagggegget 


gggagttccg 


cgccttcgag 


ttcatcatct 


ccaacaaggg 


cgtggcctcc 


780 


ggcgccatct 


acccgtacaa 


ggccgccaag 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


840 


tccgcctaca 


tcaccggcta 


cgcccgcgtg 


ccgcgcaaca 


acgagtcctc 


catgatgtac 


900 


gccgtgtcca 


ageagecgat 


caccgtggcc 


gtggacgcca 


acgccaactt 


ccagtactac 


960 
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aagtccggcg tgttcaacgg cccgcgcggc 

ggctacggcc aggactccat catctacccg 

5 ggctacatcc gcatggcccg cgacgtgtcc 

gacccgctct acccgaccct cgaggagtag 



10 <210> 78 

<211> 1068 

<212> DNA 

<213> Artificial Sequence 

15 <220> 

<223> pSM270 Sequence 

<400> 78 

atggcccgga aggtgcaggt ggtgtccctc 

20 

ccgcccgccg cctccgcctc ctcctcctcc 

accgaccgcg ccgcctccac cgacgagccg 

25 cggacggtgg agcacggccg cgcgtacaag 

atccccaaga acaacgcgaa ccacatcgag 

acccccggca tcaaccagct caccgacatg 

30 

ggcggcatct cccgcccgct caacatcgag 

gacacccccg ccgtgccgca gtccatcgac 

35 aagaaccaga acccgtgcgg cgcctgctgg 

atccacaaga tcaagaaggg catcctcgag 

gccaagggct acggccgcaa gggcggctgg 

40 

aacaagggcg tggcctccgg cgccatctac 

accgacggcg tgccgaactc cgcctacatc 

45 gagtcctcca tgatgtacgc cgtgtccaag 

gccaacttcc agtactacaa gtccggcgtg 

cacgccgtga ccgccatcgg ctacggccag 

50 

gccaagtggg gcgaggccgg ctacatccgc 

atccgcggca tcgccatcga cccgctctac 

55 

<210> 79 
<211> 1497 



PCT/US2004/007182 



acctccctca 


accacgccgt 


gaccgccatc 


1020 


aagaagcggg 


gcgccaagtg gggcgaggcc 


1080 


tcctcctccg 


gcatctgcgg 


catcgccatc 


1140 






■ 


1170 


ttcctcttcc 


cc cgcg cga c 


9 cgggccccc 


60 


ttcgccgact 


ccaacccgac 


/■« p* i <*■ • /•■* p <*i <"¥ ^ f"i 

ccycctyy 


120 


tccgacccga 


cga cgaagcg 




180 


gacaacgacg 


agaaga eg eg 


/i c* 1~ ^ f Sk n 

ccyct tccay 


240 


accttcaact 


cccgcaacga 


yddCCCCtdC 


300 


accaacaacg 


aguLcaccgc 


p p a /"*• t~ a r* a f* r* 
vLdy LaLdtv 


360 


cgcgagccgg 


cggcg ucct t 


/"•rta /^rta rrih o 

eg a cga eg eg 


420 


tggcgcgact 


acggcgccg t 


ydtLH»ty iy 


480 


gccttcgccg 


CCalCyCCaC 


r+nt" rit-% a n f*p> 


540 


ccgctctccg 


agcagcagg c 


yctcydwLyv. 


600 


gagttccgcg 


CCCCCyay t C. 




660 


ccgtacaagg 


ccgccaaygg 


Latt, ty taay 


720 


accggctacg 


cccgcgtgcc 


gcgcaacaac 


780 


cagccgatca 


ccgtggccgt 


ggacgccaac 


840 


ttcaacggcc 


cgtgcggcac 


ctccctcaac 


900 


gactccatca 


tctacccgaa 


gaagegggge 


960 


atggcccgcg 


acgtgtcctc 


ctcctccggc 


1020 


ccgaccctcg 


aggagcag 




1068 
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<212> DNA 

<213> Trichoderma reesei 



5 <220> 

<221> CDS 

<222> (1) . . (1497) 

<223> Trichoderma reesei cellobiohyrodlase I 
10 <400? 79 

atg cag teg gcg tgt act etc caa teg gag act cac ccg cct ctg aca 48 
Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
15 10 15 

> 

15 tgg cag aaa tgc teg tct ggt ggc acg tgc act caa cag aca ggc tec 96 
Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 

gtg gtc ate gac gee aac tgg cgc tgg act cac get acg aac age age 144 
20 Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

35 40 45 

acg aac tgc tac gat ggc aac act tgg age teg acc eta tgt cct gac 192 
Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
25 50 55 60 

aac gag acc tgc gcg aag aac tgc tgt ctg gac ggt gec gee tac gcg 240 
Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
65 70 75 80 

tec acg tac gga gtt acc acg age ggt aac age etc tec att ggc ttt 288 
Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

85 90 95 

35 gtc acc cag tct gcg cag aag aac gtt ggc get cgc ctt tac ctt atg 336 
Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

100 105 110 



30 



gcg age gac acg acc tac cag gaa ttc acc ctg ctt ggc aac gag ttc 
40 Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

115 120 125 



384 



tct ttc gat gtt gat gtt teg cag ctg ccg tgc ggc ttg aac gga get 432 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
45 130 135 140 

etc tac ttc gtg tec atg gac gcg gat ggt ggc gtg age aag tat ccc 480 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
145 150 155 160 

50 

acc aac acc get ggc gee aag tac ggc acg ggg tac tgt gac age cag 528 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 

55 tgt ccc cgc gat ctg aag ttc ate aat ggc cag gec aac gtt gag ggc 576 

Cys Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

180 185 190 
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tgg gag ccg tea tec aac aac gcg aac acg ggc att gga gga cac gga 624 

Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He Gly Gly His Gly 

195 200 205 

5 

age tgc tgc tct gag atg gat ate tgg gag gee aac tec ate tec gag 672 

Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser He Ser Glu 

210 215 220 

10 get ctt ace ccc cac cct tgc acg act gtc ggc cag gag ate tgc gag 720 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu 

225 230 235 240 

ggt gat ggg tgc ggc gga act tac tec gat aac aga tat ggc ggc act 768 

15 Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 



tgc gat ccc gat ggc tgc gac tgg aac cca tac cgc ctg ggc aac ace 
Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 
20 260 265 270 



25 



45 



50 aac gag acc tec tec aca ccc ggt gee gtg cgc gga age tgc tec ace 

Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 

385 390 395 400 

age tec ggt gtc cct get cag gtc gaa tct cag tct ccc aac gee aag 

55 Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

405 410 415 



816 



age ttc tac ggc cct ggc tct age ttt acc etc gat acc acc aag aaa 864 
Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
275 280 285 

ttg acc gtt gtc acc cag ttc gag acg teg ggt gee ate aac cga tac 912 
Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala He Asn Arg Tyr 
290 295 300 

30 tat gtc cag aat ggc gtc act ttc cag cag ccc aac gee gag ctt ggt 960 
Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
305 310 315 320 

agt tac tct ggc aac gag etc aac gat gat tac tgc aca get gag gag 1008 
35 Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 

gca gaa ttc ggc gga tec tct ttc tea gac aag ggc ggc ctg act cag 1056 
Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 
40 340 345 350 

ttc aag aag get acc tct ggc ggc atg gtt ctg gtc atg agt ctg tgg 1104 
Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
355 360 365 



gat gat tac tac gee aac atg ctg tgg ctg gac tec acc tac ccg aca 1152 
Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
370 375 380 



1200 



1248 
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gtc acc ttc tec aac ate aag ttc gga ccc att ggc age ace ggc aac 1296 

Val Thr Phe Ser Asn lie Lys Phe Gly Pro He Gly Ser" Thr Gly Asn 

420 425 430 

5 cct age ggc ggc aac cct ccc ggc gga aac ccg cct ggc acc acc acc 1344 

Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 440 445 

acc cgc cgc cca gee act acc act gga age tct ccc gga cct acc cag 1392 

10 Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 

450 455 460 

tct cac tac ggc cag tgc ggc ggt att ggc tac age ggc ccc acg gtc 1440 

Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 

15 465 470 475 480 

tgc gec age ggc aca act tgc cag gtc ctg aac cct tac tac tct cag 1488 

Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 

485 490 495 



20 



25 



30 



35 



40 



55 



tgc ctg taa 1497 

Cys Leu 



<210> 80 

<211> 498 

<212> PRT 

<213> Trichoderma reesei 

<400> 80 



Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
15 10 15 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 



Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

35 40 45 



45 Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
50 55 60 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
50 65 70 75 80 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

85 90 95 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 
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100 



105 



110 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
5 115 120 125 



10 



15 



Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
130 135 140 



Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
145 150 155 160 



Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 



20 Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 

180 185 190 



Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He Gly Gly His Gly 
25 195 200 205 



Ser Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser He Ser Glu 
210 215 220 



30 



35 



Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu 
225 230 235 240 



Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 



40 Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

260 265 270 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
45 275 280 285 



Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala He Asn Arg Tyr 
290 295 300 



50 



Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
305 310 315 320 



55 



Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 
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Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

340 345 350 

5 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
355 360 365 

10 

Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
370 375 380 



15 Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
385 390 395 400 



Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 
20 405 410 415 



Val Thr Phe Ser Asn He Lys Phe Gly Pro He Gly Ser Thr Gly Asn 

420 425 430 

25 

Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 440 445 

30 

Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
450 455 460 



35 Ser His Tyr Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Pro Thr Val 
465 470 475 480 



Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
40 485 490 495 



Cys Leu 

45 

<210> 81 
<211> 1365 
<212> DMA 
50 <213> Trichoderma reesei 



<220> 

<221> CDS 

55 <222> (1) . . (1365) 

<223> trichoderma reesei eel lobiohydrolase II 



253 
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<400> 81 

at 9 9^9 cct cta 9 a 9 9 a 9 c 99 caa 9 ct t9 c tca a 9 c 9 tc fc 99 99 c caa 48 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 

15 10 15 

tgt ggt ggc cag aat tgg teg ggt ccg act tgc tgt get tec gga age 96 

Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

10 aca tgc gtc tac tec aac gac tat tac tec cag tgt ctt ccc ggc get 144 

Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 

35 40 45 

gca age tea age teg tec acg cgc gee gcg teg acg act tea cga gta 192 

15 Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 

50 55 60 

tec ccc aca aca tec egg teg age tec gcg acg cct cca cct ggt tct 240 

Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 

20 65 70 75 80 

ace act ace aga gta cct cca gtc gga teg gga acc get acg tat tea 288 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 95 

25 

ggc aac cct ttt gtt ggg gtc act cct tgg gee aat gca tat tac gee 336 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 

30 tct gaa gtt age age etc get att cct age ttg act gga gee atg gee 384 

Ser Glu Val Ser Ser Leu Ala He Pro Ser Leu Thr Gly Ala Met Ala 

115 120 125 

act get gca gca get gtc gca aag gtt ccc tct ttt atg tgg cta gat 432 

35 Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 

130 135 140 

act ctt gac aag acc cct etc atg gag caa acc ttg gee gac ate cgc 480 

Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp He Arg 

40 145 150 155 160 

acc gee aac aag aat ggc ggt aac tat gec gga cag ttt gtg gtg tat 528 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 175 

45 

gac ttg ccg gat cgc gat tgc get gee ctt gee teg aat ggc gaa tac 576 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 190 

50 tct att gee gat ggt ggc gtc gee aaa tat aag aac tat ate gac acc 624 

Ser He Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr He Asp Thr 

195 200 205 

att cgt caa att gtc gtg gaa tat tec gat ate egg acc etc ctg gtt 672 

55 He Arg Gin He Val Val Glu Tyr Ser Asp lie Arg Thr Leu Leu Val 

210 215 220 



254 



WO 2005/096804 



PCT/US2004/007182 



20 



act gag cct gac tct ctt gcc aac ctg gtg acc aac etc ggt act cca 720 
He Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 
225 230 235 240 

5 aag tgt gcc aat get cag tea gcc tac ctt gag tgc ate aac tac gcc 768 
Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys He Asn Tyr Ala 

245 250 255 

gtc aca cag ctg aac ctt cca aat gtt gcg atg tat ttg gac get ggc 816 
10 Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 

260 265 270 

cat gca gga tgg cct ggc tgg ccg gca aac caa gac ccg gcc get cag 864 
His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 
15 275 280 285 

cca ttt gca aat gcc cac aag aat gca ccg cct ccg aga get ctt cgc 912 
Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 
290 295 300 

gga ttg gca acc aat gtc gcc aac tac aac ggg tgg aac att acc age 960 
Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 
305 310 315 320 

25 ccc cca teg tac acg caa ggc aac get gtc tac aac gag aag ctg tac 1008 
Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 

ate cac get att gga cct ctt ctt gcc aat cac ggc tgg tec aac gcc 1056 
30 He His Ala He Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 

340 345 350 

ttc ttc ate act gat caa ggt cga teg gga aag cag cct acc gga cag 1104 
Phe Phe He Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
35 355 360 365 

caa cag tgg gga gac tgg tgc aat gtg ate ggc acc gga ttt ggt att 1152 
Gin Gin Trp Gly Asp Trp Cys Asn Val He Gly Thr Gly Phe Gly He 
370 375 380 

40 

cgc cca tec gca aac acc ggg gac teg ttg ctg gat teg ttt gtc tgg 1200 
Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 

45 gtc aag cca ggc ggc gag tgt gac ggc acc age gac age agt gcg cca 1248 
Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

405 410 415 

cga ttt gac tec cac tgt gcg etc cca gat gcc ttg caa ccg gcg cct 1296 
50 Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 425 430 

caa get ggt get tgg ttc caa gcc tac ttt gtg cag ctt etc aca aac 1344 
Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
55 435 440 445 

gca aac cca teg ttc ctg tag 1365 
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Ala Asn Pro Ser Phe Leu 
450 



5 <210> 82 
<211> 454 
<212> PRT 

<213> Trichoderma reesei 
10 <400> 82 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
15 10 15 

15 

Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 



20 Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 

35 40 45 



Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
25 50 55 60 



Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 
65 70 75 80 

30 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 95 

35 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 



40 Ser Glu Val Ser Ser Leu Ala He Pro Ser Leu Thr Gly Ala Met Ala 

115 120 125 



Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
45 130 135 140 



Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp He Arg 
145 150 155 160 

50 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 175 

55 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 190 
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Ser lie Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr lie Asp Thr 
195 200 205 

5 

lie Arg Gin He Val Val Glu Tyr Ser Asp He Arg Thr Leu Leu Val 
210 215 220 

10 • 

He Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 
225 230 235 240 



15 Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys He Asn Tyr Ala 

245 250 255 



Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 
20 260 265 270 



His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 
275 280 285 

25 

Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 
290 295 300 

30 

Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 
305 310 315 320 



35 Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 



He His Ala He Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 
40 340 345 350 



Phe Phe He Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 

45 

Gin Gin Trp Gly Asp Trp Cys Asn Val He Gly Thr Gly Phe Gly He 
370 375 380 

50 

Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 



55 Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

405 410 415 
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Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 425 430 



Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 

435 440 445 



10 Ala Asn Pro Ser Phe Leu 

450 



<210> 83 

15 <211> 1317 

<212> DNA 

<2 13 > Trichoderma reesei 



20 <220> 

<22\> CDS 

<222> (1) . . (1317) 

< 2 2 3 > Trichoderma reesei endoglucanase I 
25 <400> 83 

atg cag caa ccg gga acc age acc ccc gag gtc cat ccc aag ttg aca 48 
Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
15 10 15 

30 acc tac aag tgc aca aag tec ggg ggg tgc gtg gec cag gac acc teg . 96 
Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 

gtg gtc ctt gac tgg aac tac cgc tgg atg cac gac gca aac tac aac 144 
35 Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 

35 40 45 

teg tgc acc gtc aac ggc ggc gtc aac acc acg etc tgc cct gac gag 192 
Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
40 50 55 60 



45 



gcg acc tgt ggc aag aac tgc ttc ate gag ggc gtc gac tac gec gec 

Ala Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala 

65 70 75 80 

teg ggc gtc acg acc teg ggc age age etc acc atg aac cag tac atg 

Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 95 



240 



288 



50 ccc age age tct ggc ggc tac age age gtc tct cct egg ctg tat etc 336 

Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 110 

ctg gac tct gac ggt gag tac gtg atg ctg aag etc aac ggc cag gag 384 

55 Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 

115 120 125 
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ctg age ttc gac gtc gac etc tct get ctg ccg tgt gga gag aac ggc 432 

Leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 

5 teg etc tac ctg tct cag atg gac gag aac ggg ggc gec aac cag tat 480 

Ser Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 160 

aac acg gec ggt gec aac tac ggg age ggc tac tgc gat get cag tgc 528 

10 Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 170 175 

ccc gtc cag aca tgg agg aac ggc acc etc aac act age cac cag ggc 576 

Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

!5 180 185 190 

ttc tgc tgc aac gag atg gat ate ctg gag ggc aac teg agg gcg aat 624 

Phe Cys Cys Asn Glu Met Asp lie Leu Glu Gly Asn Ser Arg Ala Asn 
1S5 200 205 

20 

gec ttg acc cct cac tct tgc acg gee acg gee tgc gac tct gec ggt 672 

Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 220 

25 tgc ggc ttc aac ccc tat ggc age ggc tac aaa age tac tac ggc ccc 720 

Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 235 240 

gga gat acc gtt gac acc tec aag acc ttc acc ate ate acc cag ttc 768 

30 Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr lie He Thr Gin Phe 

245 250 255 

aac acg gac aac ggc teg ccc teg ggc aac ctt gtg age ate acc cgc 816 

Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser He Thr Arg 

35 260 265 270 

aag tac cag caa aac ggc gtc gac ate ccc age gec cag ccc ggc ggc 864 

Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin Pro Gly Gly 
275 280 285 

AO 

gac acc ate teg tec tgc ccg tec gec tea gec tac ggc ggc etc gee 912 

Asp Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 

45 acc atg ggc aag gec ctg age age ggc atg gtg etc gtg ttc age att 960 

Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser He 
305 310 315 320 

tgg aac gac aac age cag tac atg aac tgg etc gac age ggc aac gec 10.08 

50 Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 335 

ggc ccc tgc age age acc gag ggc aac cca tec aac acc ctg gec aac 1056 

Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr Leu Ala Asn 

55 340 345 350 

aac ccc aac acg cac gtc gtc ttc tec aac ate cgc tgg gga gac att 1104 



259 



WO 2005/096804 PCT/US2004/007182 

Asn Pro Asn Thr His Val Val Phe Ser Asn lie Arg Trp Gly Asp lie 
355 360 365 

ggg tct act acg aac teg act gcg ccc ccg ccc ccg cct gcg tec age 1152 
5 Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 

acg acg ttt teg act aca egg agg age teg acg act teg age age ccg 1200 
Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
10 385 390 395 400 

age tgc acg cag act cac tgg ggg cag tgc ggt ggc att ggg tac age 1248 

Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 

15 

ggg tgc aag acg tgc acg teg ggc act acg tgc cag tat age aac gac 1296 

Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

420 425 430 

20 tac tac teg caa tgc ctt tag 1317 
Tyr Tyr Ser Gin Cys Leu 
435 



25 <210> 84 

<211> 438 

<212> PRT 

<213> Trichoderma reesei 

30 <400> 84 

Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
1 5 10 15 



35 



Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 



40 Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 

35 40 45 



Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
45 50 55 60 



Ala Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala 
65 70 75 80 

50 

Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 95 

55 

Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 110 
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Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
H* 120 125 



Leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 



Ser Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 " 160 



Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 170 175 



Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

180 185 190 



Phe Cys Cys Asn Glu Met Asp He Leu Glu Gly Asn Ser Arg Ala Asn 
195 200 205 



Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 220 



Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 235 240 



Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr He He Thr Gin Phe 

245 250 255 



Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser He Thr Arg 

260 265 '* 270 



Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin Pro Gly Gly 
275 280 285 



Asp Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 



Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser He 
305 310 315 320 



Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 335 
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5 



20 



25 



Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr -Leu Ala Asn 

340 345 350 



Asn Pro Asn Thr His Val Val Phe Ser Asn lie Arg Trp Gly Asp He 
355 360 365 



10 Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 



Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
15 335 390 395 400 



Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 



Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

4 20 425 430 



Tyr Tyr Ser Gin Cys Leu 
435 



30 <210> 85 

<211> 954 

<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> 6GP1 



<220> 

40 <221> CDS 

<222> (1) . . (954 ) 

<223> 6GP1 



c400> 85 

45 atg ggc gtg gac ccg ttc gag cgc aac aag ate etc ggc cgc ggc ate 

Met Gly Val Asp Pro Phe Glu Arg Asn Lys He Leu Gly Arg Gly He 

15 10 15 

aac ate ggc aac gee ctg gag gee ccg aac gag ggc gac tgg ggc gtg 

50 Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 



48 



96 



gtg ate aag gac gag ttc ttc gac ate ate aag gag gee ggc ttc tec 144 

Val He Lys Asp Glu Phe Phe Asp He He Lys Glu Ala Gly Phe Ser 
55 35 40 45 

cac gtg cgc ate ccg ate cgc tgg tec ace cac gee tac gee ttc ccg 192 
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His Val Arg lie Pro lie Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
50 55 60 

ccg tac aag ate atg gac cgc ttc ttc aag cgc gtg gac gag gtg ate 240 

5 Pro Tyr Lys lie Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val lie 
65 70 75 80 

aac ggc gec etc aag cgc ggc etc gec gtg gee ate aac ate cac cac 288 

Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Ala lie Asn lie His His 

10 85 90 95 

tac gag gag etc atg aac gac ccg gag gag cac aag gag cgc ttc etc 336 

Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 110 

15 

gec etc tgg aag cag ate gee gac cgc tac aag gac tac ccg gag acc 384 

Ala Leu Trp Lys Gin lie Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 

115 120 125 

20 etc ttc ttc gag ate etc aac gag ccg cac ggc aac etc acc ccg gag 432 

Leu Phe Phe Glu lie Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
130 135 140 

aag tgg aac gag ctg etc gag gag gee etc aag gtg ate cgc tec ate 480 

25 Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val He Arg Ser He 
145 150 155 160 

gac aag aag cac acc ate ate att ggc acc gca gag tgg gga ggc ate 528 

Asp Lys Lys His Thr He He lie Gly Thr Ala Glu Trp Gly Gly He 

30 165 170 175 

tec gec etc gag aag etc tec gtg ccg aag tgg gag aag aat tec ate 576 

Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser He 

180 185 190 

35 

gtg acc ate cac tac tac aac ccg ttc gag ttc acg cac cag ggc gee 624 

Val Thr He His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 

195 200 205 

40 gag tgg gtg gag ggc tec gag aag tgg ctt ggc cgc aag tgg ggc tec 672 

Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
210 215 220 

ccg gac gac cag aag cac etc ate gag gag ttc aac ttc ate gag gag 720 

45 Pro Asp Asp Gin Lys His Leu He Glu Glu Phe Asn Phe He Glu Glu 
225 230 235 240 

tgg tec aag aag aac aag cgc ccg ate tac ate ggc gag ttt ggc gec 768 

Trp Ser Lys Lys Asn Lys Arg Pro He Tyr lie Gly Glu Phe Gly Ala 

50 245 250 255 

tac cgc aag gee gac etc gag tec cgc ate aag tgg acc tec ttc gtg 816 

Tyr Arg Lys Ala Asp Leu Glu Ser Arg He Lys Trp Thr Ser Phe Val 

260 265 270 



55 



gtg cgt gag atg gag aag cgc cgc tgg tec tgg gee tac tgg gag ttc 864 
val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
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275 280 285 

tgc tec ggc ttc ggc gtg tac gac acc etc cgc aag ace tgg aac aag 912 
Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
5 290 295 300 

gac etc etc gag gee etc ate ggc ggc gac tec ate gag tag 954 
Asp Leu Leu Glu Ala Leu lie Gly Gly Asp Ser lie Glu 
305 310 315 

10 

<210> 86 
<211> 317 
<212> PRT 
15 <213> Artificial Sequence 

<220> 

<223> Synthetic Construct 
20 <400> 86 

Met Gly Val Asp Pro Phe Glu Arg Asn Lys He Leu Gly Arg Gly He 
1 5 10 15 

Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 

30 Val He Lys Asp Glu Phe Phe Asp He He Lys Glu Ala Gly Phe Ser 

35 40 45 

His Val Arg He Pro He Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 

35 50 55 60 

Pro Tyr Lys He Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val He 
65 70 75 80 

40 

Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Ala He Asn He His His 

85 90 95 

45 

Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 110 



50 Ala Leu Trp Lys Gin He Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 

115 120 125 



Leu Phe Phe Glu He Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
55 130 135 140 



264 



WO 2005/096804 PCT/US2004/007182 

Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val He Arg Ser He 
145 150 155 160 



5 Asp Lys Lys His Thr lie He He Gly Thr Ala Glu Trp Gly Gly He 

165 170 175 



Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser He 
10 180 185 190 



Val Thr lie His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 
195 200 205 

15 

Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
210 215 220 

20 

Pro Asp Asp Gin Lys His Leu lie Glu Glu Phe Asn Phe lie Glu Glu 
225 230 235 240 



25 Trp Ser Lys Lys Asn Lys Arg Pro lie Tyr lie Gly Glu Phe Gly Ala 

245 250 255 



Tyr Arg Lys Ala Asp Leu Glu Ser Arg lie Lys Trp Thr Ser Phe Val 
30 260 265 270 



Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
275 280 285 

35 

Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
290 295 300 

40 

Asp Leu Leu Glu Ala Leu lie Gly Gly Asp Ser He Glu 
305 310 315 



45 <210> 87 

<211> 1248 

<212> DNA 

<213> Hordeum vulgare 

50 

<220> 

<221> CDS 

<222> (1) . . (1248) 

<223> Barley Amyl amylase 

55 

<400> 87 

atg gca cac caa gtc etc ttt cag ggg ttc aac tgg gag teg tgg aag 48 
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Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 

cag age ggc ggg tgg tac aac atg atg atg ggc aag gtc gac gac ate 96 
5 Gin Ser Gly Gly Trp Tyr Asn Met Met Met Gly Lys Val Asp Asp lie 

20 25 30 

gee get gee gga gtc ace cac gtc tgg ctg cca ccg ccg teg cac tec 144 
Ala Ala Ala Gly Val Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
10 35 40 45 

gtc tec aac gaa ggt tac atg cct ggt egg ctg tac gac ate gac gcg 192 
Val Ser Asn Glu Gly Tyr Met Pro Gly Arg Leu Tyr Asp lie Asp Ala 
50 55 60 

15 

tec aag tac ggc aac gcg gcg gag etc aag teg etc ate ggc gcg etc 240 
Ser Lys Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu lie Gly Ala Leu 
65 70 75 80 

20 cac ggc aag ggc gtg cag gee ate gec gac ate gtc ate aac cac cgc 288 
His Gly Lys Gly Val Gin Ala He Ala Asp He Val He Asn His Arg 

85 90 95 

tgc gec gac tac aag gat age cgc ggc ate tac tgc ate ttc gag ggc 336 
25 Cys Ala Asp Tyr Lys Asp Ser Arg Gly He Tyr Cys He Phe Glu Gly 

100 105 110 

ggc ace tec gac ggc cgc etc gac tgg ggc ccc cac atg ate tgt cgc 384 
Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met He Cys Arg 
30 115 120 125 

gac gac ace aaa tac tec gat ggc ace gca aac etc gac ace gga gec 432 
Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 
130 135 140 

35 

gac ttc gee gee gcg ccc gac ate gac cac etc aac gac egg gtc cag 480 
Asp Phe Ala Ala Ala Pro Asp He Asp His Leu Asn Asp Arg Val Gin 
145 150 155 160 

40 cgc gag etc aag gag tgg etc etc tgg etc aag age gac etc ggc ttc 528 
Arg Glu Leu Lys Glu Trp Leu Leu Trp Leu Lys Ser Asp Leu Gly Phe 

165 170 175 

gac gcg tgg cgc ctt gac ttc gee agg ggc tac teg ccg gag atg gec 576 
45 Asp Ala Trp Arg Leu Asp Phe Ala Arg Gly Tyr Ser Pro Glu Met Ala 

180 185 190 



aag gtg tac ate gac ggc aca tec ccg age etc gee gtg gee gag gtg 
Lys Val Tyr He Asp Gly Thr Ser Pro Ser Leu Ala Val Ala Glu Val 
50 195 200 205 



624 



55 



tgg gac aat atg gee acc ggc ggc gac ggc aag ccc aac tac gac cag 672 

Trp Asp Asn Met Ala Thr Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin 

210 215 220 

gac gcg cac egg cag aat ctg gtg aac tgg gtg gac aag gtg ggc ggc 720 

Asp Ala His Arg Gin Asn Leu Val Asn Trp Val Asp Lys Val Gly Gly 



266 



WO 2005/09(5804 



PCT/US2004/007182 



10 



225 230 235 240 

gcg gcc teg gca ggc atg gtg ttc gac ttc acg acc aaa ggg ata ctg 768 
Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 

245 250 255 

aac get gcc gtg gag ggc gag ctg tgg agg ctg ate gac ccg cag ggg 816 
Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu lie Asp Pro Gin Gly 

260 265 270 

aag gcc ccc ggc gtg atg gga tgg tgg ccg gcc aag gcc gtc acc ttc 864 
Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 

15 gtc gac aac cac gat aca ggc tec acg cag gcc atg tgg cca ttc ccc 912 
Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

tec gac aag gtc atg cag ggc tac gcg tac ate etc acc cac ccc ggc 960 
20 Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr He Leu Thr His Pro Gly 
305 310 315 320 

ate cca tgc ate ttc tac gac cat ttc ttc aac tgg ggg ttt aag gac 1008 
He Pro Cys He Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 
25 325 330 335 

cag ate gcg gcg ctg gtg gcg ate agg aag cgc aac ggc ate acg gcg 1056 
Gin lie Ala Ala Leu Val Ala lie Arg Lys Arg Asn Gly He Thr Ala 

340 345 350 

acg age get ctg aag ate etc atg cac gaa gga gat gcc tac gtc gcc 1104 
Thr Ser Ala Leu Lys He Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 

35 gag ata gac ggc aag gtg gtg gtg aag ate ggg tec agg tac gac gtc 1152 
Glu He Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
370 375 380 

ggg gcg gtg ate ccg gcc ggg ttc gtg acc teg gca cac ggc aac gac 1200 
40 Gly Ala Val lie Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 
385 390 395 400 

tac gcc gtc tgg gag aag aac ggt gcc gcg gca aca eta caa egg age 1248 
Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 
45 405 410 415 



<210> 88 

<2ll> 416 

50 <212> PRT 

<213> Hordeum vulgare 

<400> 88 

55 Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 
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Gin Ser 



5 

Ala Ala 



10 Val Ser 
50 



Ser Lys 
15 65 



His Gly 

20 

Cys Ala 



25 

Gly Thr 



30 Asp Asp 
130 



Asp Phe 
35 145 



Arg Glu 

40 

Asp Ala 



45 

Lys Val 



50 Trp Asp 
210 



Asp Ala 
55 225 



Gly Gly Trp 
20 



Ala Gly Val 
35 



Asn Glu Gly 



Tyr Gly Asn 



Lys Gly Val 
85 



Asp Tyr Lys 
100 



Ser Asp Gly 
115 



Thr Lys Tyr 



Ala Ala Ala 



Leu Lys Glu 
165 



Trp Arg Leu 
180 



Tyr lie Asp 
195 



Asn Met Ala 



His Arg Gin 



Tyr Asn Met 



Thr His Val 
40 



Tyr Met Pro 
55 



Ala Ala Glu 
70 



Gin Ala He 



Asp Ser Arg 



Arg Leu Asp 
120 



Ser Asp Gly 
135 



Pro Asp 
150 



Trp Leu 



Asp Phe 



Gly Thr 



Thr Gly 
215 



Asn Leu 
230 



Met Met 
25 

Trp Leu 
Gly Arg 
Leu Lys 



Ala Asp 
90 



Gly He 
105 



Trp Gly 



Thr Ala 



He Asp His 



Leu Trp Leu 
170 

Ala Arg Gly 
185 



Ser Pro Ser 
200 



Gly Asp Gly 



Val Asn Trp 



Gly Lys Val 



Pro Pro Pro 
45 



Leu Tyr Asp 
60 



Ser Leu He 
75 



He Val He 



Tyr Cys He 



Pro His Met 
125 



Asn Leu Asp 
140 



Leu Asn Asp 
155 



Lys Ser Asp 



Tyr Ser Pro 



Leu Ala Val 
205 



Lys Pro Asn 
220 



Val Asp Lys 
235 



Asp Asp He 
30 



Ser His Ser 



He Asp Ala 



Gly Ala Leu 
80 



Asn His Arg 
95 



Phe Glu Gly 
110 



He Cys Arg 



Thr Gly Ala 



Arg val Gin 
160 



Leu Gly Phe 
175 



Glu Met Ala 
190 



Ala Glu Val 



Tyr Asp Gin 



Val Gly Gly 
240 
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Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly He Leu 

245 250 255 



5 Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu He Asp Pro Gin Gly 

260 265 270 



Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
10 275 280 285 



Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

15 

Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr He Leu Thr His Pro Gly 
305 310 315 320 

20 

He Pro Cys He Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

325 330 335 



25 Gin He Ala Ala Leu Val Ala He Arg Lys Arg Asn Gly He Thr Ala 

340 345 350 



Thr Ser Ala Leu Lys He Leu Met His Glu Gly Asp Ala Tyr Val Ala 
30 355 360 365 



Glu lie Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
370 375 380 

35 

Gly Ala Val He Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 
385 390 395 400 

40 

Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

405 410 415 



45 <210> 89 

<211> 1401 

< 2 1 2 > DNA 

<213> Artificial Sequence 
50 <220> 

<223> Trichoderma reesei S-Glucosidase 2 



<220> 

55 <221> CDS 

<222> (1) . . (1401) 

<223> Trichoderma reesei S-Glucosidase 2 
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<400> 89 

atg ttg ccc aag gac ttt cag tgg ggg ttc gcc acg get gec tac cag 48 
Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 

ate gag ggc gcc gtc gac cag gac ggc cgc ggc ccc age ate tgg gac 96 
He Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser He Trp Asp 

20 25 30 

acg ttc tgc gcg cag ccc ggc aag ate gcc gac ggc teg teg ggc gtg 144 
Thr Phe Cys Ala Gin Pro Gly Lys He Ala Asp Gly Ser Ser Gly Val 
35 40 45 

15 acg gcg tgc gac teg tac aac cgc acg gcc gag gac att gcg ctg ctg 192 
Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp He Ala Leu Leu 
50 55 60 

aag teg etc ggg gcc aag age tac cgc ttc tec ate teg tgg teg cgc 240 
20 Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser He Ser Trp Ser Arg 
65 70 75 80 

ate ate ccc gag ggc ggc cgc ggc gat gcc gtc aac cag gcg ggc ate 288 
He He Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly lie 
25 85 90 95 

gac cac tac gtc aag ttc gtc gac gac ccg etc gac gcc ggc ate acg 336 

Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly He Thr 

100 105 110 

30 

ccc ttc ate acc etc ttc cac tgg gac ctg ccc gag ggc ctg cat cag 384 

Pro Phe He Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 

35 egg tac ggg ggg ctg ctg aac cgc acc gag ttc ccg etc gac ttt gaa 432 
Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 

aac tac gcc cgc gtc atg ttc agg gcg ctg ccc aag gtg cgc aac tgg 480 
40 Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
145 150 155 160 

ate acc ttc aac gag ccg ctg tgc teg gcc ate ccg ggc tac ggc tec 528 
He Thr Phe Asn Glu Pro Leu Cys Ser Ala He Pro Gly Tyr Gly Ser 
45 165 170 175 

ggc acc ttc gcc ccc ggc egg cag age acc teg gag ccg tgg acc gtc 576 

Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 

50 

ggc cac aac ate etc gtc gcc cac ggc cgc gcc gtc aag gcg tac cgc 624 

Gly His Asn He Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 

55 gac gac ttc aag ccc gcc age ggc gac ggc cag ate ggc ate gtc etc 672 
Asp Asp Phe Lys Pro Ala Ser Gly Asp Gly Gin He Gly He Val Leu 
210 215 220 
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aac ggc gac ttc acc tac ccc tgg gac gcc gcc gac ccg gcc gac aag 720 

Asn Gly Asp Phe Thr Tyr Pro Trp Asp Ala Ala Asp Pro Ala Asp Lys 

225 230 235 240 

5 

gag gcg gcc gag egg cgc etc gag ttc ttc acg gcc tgg ttc gcg gac 768 

Glu Ala Ala Glu Arg Arg Leu Glu Phe Phe Thr Ala Trp Phe Ala Asp 

245 250 255 

10 ccc ate tac ttg ggc gac tac ccg gcg teg atg cgc aag cag ctg ggc 816 

Pro He Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Lys Gin Leu Gly 

260 265 270 

gac egg ctg ccg acc ttt acg ccc gag gag cgc gcc etc gtc cac ggc 864 

15 Asp Arg Leu Pro Thr Phe Thr Pro Glu Glu Arg Ala Leu Val His Gly 

275 280 285 

tec aac gac ttt tac ggc atg aac cac tac acg tec aac tac ate cgc 912 

Ser Asn Asp Phe Tyr Gly Met Asn His Tyr Thr Ser Asn Tyr He Arg 

20 290 295 300 

cac cgc age teg ccc gcc tec gcc gac gac acc gtc ggc aac gtc gac 960 

His Arg Ser Ser Pro Ala Ser Ala Asp Asp Thr Val Gly Asn Val Asp 

305 310 315 320 

25 

gtg etc ttc acc aac aag cag ggc aac tgc ate ggc ccc gag acg cag 1008 

Val Leu Phe Thr Asn Lys Gin Gly Asn Cys He Gly Pro Glu Thr Gin 

325 330 335 

30 tec ccc tgg ctg cgc ccc tgt gcc gcc ggc ttc cgc gac ttc ctg gtg 1056 

Ser Pro Trp Leu Arg Pro Cys Ala Ala Gly Phe Arg Asp Phe Leu Val 

340 345 350 

tgg ate age aag agg tac ggc tac ccg ccc ate tac gtg acg gag aac 1104 

35 Trp He Ser Lys Arg Tyr Gly Tyr Pro Pro He Tyr Val Thr Glu Asn 

355 360 365 

ggc acg age ate aag ggc gag age gac ttg ccc aag gag aag att etc 1152 

Gly Thr Ser He Lys Gly Glu Ser Asp Leu Pro Lys Glu Lys He Leu 

40 370 375 380 

gaa gat gac ttc agg gtc aag tac tat aac gag tac ate cgt gcc atg 1200 

Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr He Arg Ala Met 

385 390 395 400 

45 

gtt acc gcc gtg gag ctg gac ggg gtc aac gtc aag ggg tac ttt gcc 1248 

Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 

50 tgg teg etc atg gac aac ttt gag tgg gcg gac ggc tac gtg acg agg 1296 

Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 

ttt ggg gtt acg tat gtg gat tat gag aat ggg cag aag egg ttc ccc 1344 

55 Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 

435 440 445 
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aag aag age gca aag age ttg aag ccg ctg ttt gac gag ctg att gcg 1392 
Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu lie Ala 
450 455 460 

gcg gcg tga 140 i 
Ala Ala 



465 




<210> 


90 


<211> 


466 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223 > 


Synthetic Construct 


<400> 


90 



Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 



He Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser lie Trp Asp 

20 25 30 



Thr Phe Cys Ala Gin Pro Gly Lys He Ala Asp Gly Ser Ser Gly Val 
35 40 45 



Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp He Ala Leu Leu 
50 55 60 



Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser He Ser Trp Ser Arg 
65 70 75 80 



He He Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly He 

85 90 95 



Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly He Thr 

100 105 110 



Pro Phe He Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 



Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 



Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
145 150 155 160 
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He Thr Phe 

5 

Gly Thr Phe 



10 

Gly His Asn 
195 



15 Asp Asp Phe 
210 



Asn Gly Asp 
20 225 



Glu Ala Ala 

25 

Pro He Tyr 

30 

Asp Arg Leu 
275 

35 Ser Asn Asp 
290 



His Arg Ser 
40 305 



val Leu Phe 

45 

Ser Pro Trp 

50 

Trp He Ser 
355 



55 Gly Thr Ser 
370 



Asn Glu Pro Leu 
165 



Ala Pro Gly Arg 
180 



He Leu Val Ala 



Lys Pro Ala Ser 

215 



Phe Thr Tyr Pro 
230 



Glu Arg Arg Leu 
245 



Leu Gly Asp Tyr 
260 



Pro Thr Phe Thr 



Phe Tyr Gly Met 

295 



Ser Pro Ala Ser 
310 



Thr Asn Lys Gin 
325 



Leu Arg Pro Cys 

340 



Lys Arg Tyr Gly 



lie Lys Gly Glu 

375 



Cys Ser Ala He Pro 
170 



Gin Ser Thr Ser Glu 
185 



His Gly Arg Ala Val 
200 



Gly Asp Gly Gin He 

220 



Trp Asp Ala Ala Asp 

235 



Glu Phe Phe Thr Ala 
250 



Pro Ala Ser Met Arg 
265 



Pro Glu Glu Arg Ala 
280 



Asn His Tyr Thr Ser 

300 



Ala Asp Asp Thr Val 

315 



Gly Asn Cys He Gly 
330 



Ala Ala Gly Phe Arg 
345 



Tyr Pro Pro He Tyr 
360 



Ser Asp Leu Pro Lys 

380 



Gly Tyr Gly Ser 
175 



Pro Trp Thr Val 
190 



Lys Ala Tyr Arg 
205 



Gly He Val Leu 



Pro Ala Asp Lys 

240 



Trp Phe Ala Asp 
255 



Lys Gin Leu Gly 
270 



Leu Val His Gly 
285 



Asn Tyr lie Arg 



Gly Asn Val Asp 

320 



Pro Glu Thr Gin 
335 



Asp Phe Leu Val 
350 



Val Thr Glu Asn 
365 



Glu Lys He Leu 
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5 



20 



25 



30 



Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr lie Arg Ala Met 
385 390 395 400 



Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 



10 Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 



Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
15 435 440 445 



Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu He Ala 
450 455 460 



Ala Ala 
465 



<210> 91 

<211> 2103 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei fc-Glucosidase D 



48 



35 <220> 

<221> CDS 
<222> (1) . . (2103) 

<223> Trichoderma reesei G-Glucosidase D 

40 <400> 91 

atg att etc ggc tgt gaa age aca ggt gtc ate tct gec gtc aaa cac 
Met He Leu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
15 10 15 

« 

45 ttt gtc gec aac gac cag gag cac gag egg cga gcg gtc gac tgt etc 
Phe Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 

20 25 30 

ate acc cag egg get etc egg gag gtc tat ctg cga ccc ttc cag ate 
50 lie Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin He 

35 40 45 

gta gec cga gat gca agg ccc ggc gca ttg atg aca tec tac aac aag 192 
Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
55 50 55 60 



96 



144 



itc aat ggc aag cac gtc get gac age gec gag ttc ctt cag ggc att 



240 
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Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly He 

65 70 75 80 

etc egg act gag tgg aat tgg gac cct etc att gtc age gac tgg tac 288 

5 Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu He Val Ser Asp Trp Tyr 

85 90 95 

ggc acc tac ace act att gat gee ate aaa gee ggc ctt gat etc gag 336 

Gly Thr Tyr Thr Thr He Asp Ala He Lys Ala Gly Leu Asp Leu Glu 

10 100 105 110 

atg ccg ggc gtt tea cga tat cgc ggc aaa tac ate gag tct get ctg 384 

Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr He Glu Ser Ala Leu 

115 120 125 

15 

cag gec cgt ttg ctg aag cag tec act ate gat gag cgc get cgc cgc 432 

Gin Ala Arg Leu Leu Lys Gin Ser Thr He Asp Glu Arg Ala Arg Arg 

130 135 140 

20 gtg etc agg ttc gec cag aag gee age cat etc aag gtc tec gag gta 480 

Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 

145 150 155 160 

gag caa ggc cgt gac ttc cca gag gat cgc gtc etc aac cgt cag ate 528 

25 Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin He 

165 170 175 

tgc ggc age age att gtc eta ctg aag aat gag aac tec ate tta cct 576 

Cys Gly Ser Ser lie Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 

30 180 185 190 

etc ccc aag tec gtc aag aag gtc gee ctt gtt ggt tec cac gtg cgt 624 

Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 

195 200 205 

35 

eta ccg get ate teg gga gga ggc age gec tct ctt gtc cct tac tat 672 

Leu Pro Ala He Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 

210 215 220 

40 gec ata tct eta tac gat gee gtc tct gag gta eta gee ggt gee acg 720 

Ala He Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 

225 230 235 240 

ate acg cac gag gtc ggt gee tat gee cac caa atg ctg ccc gtc ate 768 

45 He Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val He 

245 250 255 

gac gca atg ate age aac gec gta ate cac ttc tac aac gac ccc ate 816 

Asp Ala Met He Ser Asn Ala Val He His Phe Tyr Asn Asp Pro He 

50 260 265 270 

gat gtc aaa gac aga aag etc ctt ggc agt gag aac gta teg teg aca 864 

Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 

275 280 285 

55 

teg ttc cag etc atg gat tac aac aac ate cca acg etc aac aag gee 912 

Ser Phe Gin Leu Met Asp Tyr Asn Asn He Pro Thr Leu Asn Lys Ala 



275 
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290 295 300 

atg ttc tgg ggt act etc gtg ggc gag ttt ate cct acc gec acg gga 960 

Met Phe Trp Gly Thr Leu Val Gly Glu Phe He Pro Thr Ala Thr Gly 

305 310 315 320 

att tgg gaa ttt ggc etc agt gtc ttt ggc act gec gac ctt tat att 1008 

He Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 

gat aat gag etc gtg att gaa aat aca aca cat cag acg cgt gga acc 1056 

Asp Asn Glu Leu Val He Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

15 gec ttt ttc gga aag gga acg acg gaa aaa gtc get acc agg agg atg 1104 

Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 

355 360 365 

gtg gec ggc age acc tac aag ctg cgt etc gag ttt ggg tct gec aac 1152 

20 Val Ala Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 

370 375 380 

acg acc aag atg gag acg acc ggt gtt gtc aac ttt ggc ggc ggt gec 1200 

Thr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 

25 385 390 395 400 

gta cac ctg ggt gec tgt etc aag gtc gac cca cag gag atg att gcg 1248 

Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met He Ala 

405 410 415 

30 

egg gec gtc aag gee gca gec gat gec gac tac acc ate ate tgc acg 1296 

Arg Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 425 430 

35 gga etc age ggc gag tgg gag tct gag ggt ttt gac egg cct cac atg 1344 

Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 

435 440 445 

gac ctg ccc cct ggt gtg gac acc atg ate teg caa gtt ctt gac gec 1392 

40 Asp Leu Pro Pro Gly Val Asp Thr Met He Ser Gin Val Leu Asp Ala 

450 455 460 

get ccc aat get gta gtc gtc aac cag tea ggc acc cca gtg aca atg 1440 

Ala Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 

45 465 470 475 480 

age tgg get cat aaa gca aag gee att gtg cag get tgg tat ggt ggt 1488 

Ser Trp Ala His Lys Ala Lys Ala lie Val Gin Ala Trp Tyr Gly Gly 

485 490 495 

50 

aac gag aca ggc cac gga ate tec gat gtg etc ttt ggc aac gtc aac 1536 

Asn Glu Thr Gly His Gly lie Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 

55 ccg teg ggg aaa etc tec eta teg tgg cca gtc gat gtg aag cac aac 1584 

Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 

515 ' 520 525 
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cca gca tat etc aac tac gee age gtt ggt gga egg gtc ttg tat ggc 1632 

Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 

530 535 540 

5 

gag gat gtt tac gtt ggc tac aag ttc tac gac aaa acg gag agg gag 1680 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 550 555 560 

10 gtt ctg ttt cct ttt ggg cat ggc ctg tct tac get acc ttc aag etc 1728 
Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

cca gat tct acc gtg agg acg gtc ccc gaa acc ttc cac ccg gac cag 1776 
15 Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 

ccc aca gta gec att gtc aag ate aag aac acg age agt gtc ccg ggc 1824 
Fro Thr Val Ala lie Val Lys lie Lys Asn Thr Ser Ser Val Pro Gly 
20 595 600 605 

gec cag gtc ctg cag tta tac att teg gec cca aac teg cct aca cat 1872 

Ala Gin Val Leu Gin Leu Tyr lie Ser Ala Pro Asn Ser Pro Thr His 
610 615 620 

25 

cgc ccg gtc aag gag ctg cac gga ttc gaa aag gtg tat ctt gaa get 

Arg Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 

625 630 635 640 

30 ggc gag gag aag gag gta caa ata ccc att gac cag tac get act age 
Gly Glu Glu Lys Glu Val Gin He Pro He Asp Gin Tyr Ala Thr Ser 

645 650 655 



gat gtg ctt gta gga ttc teg agt cag gaa ate teg ggc aag ggg aag 
Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 
40 675 680 685 

ctg att gtg cct gaa acg cga ttc tgg atg ggg ctg tag 
Leu He Val Pro Glu Thr Arg Phe Trp Met Gly Leu 
690 695 700 

45 

<210> 92 
<211> 700 
c212> PRT 
50 <213> Artificial Sequence 

<220> 

<223> Synthetic Construct 
55 <400> 92 

Met lie Leu Gly Cys Glu Ser Thr Gly Val lie Ser Ala Val Lys His 



1920 



1968 



ttc tgg gac gag att gag age atg tgg aag age gag agg ggc att tat 2016 
35 Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly He Tyr 

660 665 670 



2064 



2103 
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10 15 



Phe Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 
5 20 25 30 



He Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin He 
35 40 45 

10 

Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

15 

Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly He 
65 70 75 80 



20 Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu He Val Ser Asp Trp Tyr 

85 90 95 



Gly Thr Tyr Thr Thr He Asp Ala lie Lys Ala Gly Leu Asp Leu Glu 
25 100 105 110 



Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr He Glu Ser Ala Leu 
115 120 125 

30 

Gin Ala Arg Leu Leu Lys Gin Ser Thr He Asp Glu Arg Ala Arg Arg 
130 135 140 

35 

Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
145 150 155 160 



40 Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin lie 

165 170 175 



Cys Gly Ser Ser lie Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 
45 180 185 190 



Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
1*95 200 205 

50 

Leu Pro Ala He Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 
210 215 220 

55 

Ala He Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 
225 230 235 240 
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He Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val He 

245 250 255 

5 

Asp Ala Met He Ser Asn Ala Val He His Phe Tyr Asn Asp Pro He 

260 265 270 

10 

Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 



15 Ser Phe Gin Leu Met Asp Tyr Asn Asn He Pro Thr Leu Asn Lys Ala 
290 295 300 



Met Phe Trp Gly Thr Leu Val Gly Glu Phe He Pro Thr Ala Thr Gly 
20 305 310 315 320 



lie Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 

25 

Asp Asn Glu Leu Val He Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

30 

Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 ' 360 365 



35 Val Ala Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 
370 375 380 



Thr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 
40 385 390 395 400 



Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met He Ala 

405 410 415 

45 

Arg Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 425 430 

50 

Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 
435 440 445 



55 Asp Leu Pro Pro Gly Val Asp Thr Met He Ser Gin Val Leu Asp Ala 
450 455 460 
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Ala Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 
465 470 475 480 

5 

Ser Trp Ala His Lys Ala Lys Ala lie Val Gin Ala Trp Tyr Gly Gly 

485 490 495 



10 Asn Glu Thr Gly His Gly He Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 



Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 
15 515 520 525- 



Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 
530 535 540 

20 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 550 555 560 

25 

Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 



30 Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 



Pro Thr Val Ala He Val Lys He Lys Asn Thr Ser Ser Val Pro Gly 
35 595 600 605 



Ala Gin Val Leu Gin Leu Tyr He Ser Ala Pro Asn Ser Pro Thr His 
610 615 620 

40 

Arg Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 
625 630 635 640 

45 

Gly Glu Glu Lys Glu Val Gin He Pro He Asp Gin Tyr Ala Thr Ser 

645 650 655 



50 Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly lie Tyr 

660 665 670 



Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 
55 675 680 685 
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Leu lie Val Pro Glu Thr Arg Phe Trp Met Gly Leu 
690 695 700 



5 <210> 93 

<211> 1496 

<212> DNA 

<213> Artificial Sequence 



10 <220> 

<223> Maize optimized CBHI 



15 



25 



35 



45 



<400> 93 



55 



tgcagtccgc 


ctgcaccctc 


cagtccgaga 


cccacccgcc 


gctcacctgg 


cagaagtget 


60 


cctccggcgg 


cacctgcacc 


cagcagaccg 


gctccgtggt 


gatcgacgcc 


aactggeget 


120 


ggacccacgc 


caccaactcc 


tccaccaact 


getacgaegg 


caacacctgg 


tcctccaccc 


180 


tctgcccgga 


caacgagacc 


tgegecaaga 


actgctgcct 


cgacggcgcc 


gcctacgcct 


240 


ccacctacgg 


eg tgaccacc 


tccggcaact 


ccctctccat 


eggcttegtg 


acccagtccg 


300 


cccagaagaa 


cgtgggcgcc 


cgcctctacc 


tcatggcctc 


cgacaccacc 


taccaggagt 


360 


tcaccctcct 


eggcaacgag 


ttctcct teg 


acgtggacgt 


gtcccagctc 


ccgtgcggcc 


420 


tcaacggcgc 


cctctacttc 


gtgtccatgg 


acgccgacgg 


cggcgtgtcc 


aagtacccga 


480 


ccaacaccgc 


cggcgccaag 


tacggcaccg 


getactgega 


ctcccagtgc 


ccgcgcgacc 


540 


tcaagttcat 


caacggccag 


gccaacgtgg 


agggctggga 


gccgtcctcc 


aacaacgcca 


600 


acaccggcat 


cggcggccac 


ggctcctgct 


gctccgagat 


ggacatctgg 


gaggecaact 


660 


/-< V- f» /*% ^ f-H ~\ 

CCdtCtCCya 


ft f* ^ f* f zi f~* 








atctacaaaa 


720 


gcgacggctg 


cggcggcacc 


tactccgaca 


accgctacgg 


cggcacctgc 


gacccggacg 


780 


gctgcgactg 


gaacccgtac 


cgcctcggca 


acacctcctt 


ctacggcccg 


ggctcctcct 


840 


tcaccctcga 


caccaccaag 


aagctcaccg 


tggtgaccca 


gttcgagacc 


tccggcgcca 


900 


tcaaccgcta 


ctacgtgcag 


aacggcgtga 


ccttccagca 


gccgaacgcc 


gagctegget 


960 


cctactccgg 


caacgagctc 


aacgacgact 


actgcaccgc 


egaggaggee 


gagttcggcg 


1020 


gctcctcctt 


ctccgacaag 


ggcggcctca 


cccagttcaa 


gaaggccacc 


tccggcggca 


1080 


tggtgctcgt 


gatgtccctc 


tgggacgact 


actacgccaa 


catgetctgg 


ctcgactcca 


1140 


cctacccgac 


caacgagacc 


tcctccaccc 


cgggcgccgt 


gcgcggctcc 


tgctccacct 


1200 


cctccggcgt 


gccggcccag 


gtggagtccc 


agtccccgaa 


cgccaaggtg 


accttctcca 


1260 


acatcaagtt 


cggcccgacc 


ggctccaccg 


gcaacccgtc 


cggcggcaac 


ccgccgggcg 


1320 
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gcaacccgcc gggcaccacc accacccgcc 
gcccgaccca gtcccactac ggccagtgcg 
gcgcctccgg caccacctgc caggtgctca 

<210> 94 

<211> 1365 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223 > Maize optimized CBHII 



<400> 94 

atggtgccgc 


tcgaggagcg 


ccaggcctgc 


aactggtccg 


gcccgacctg 


ctgcgcctcc 


tactcccagt 


gcctcccggg 


cgccgcctcc 


acctcccgcg 


tgtccccgac 


cacctcccgc 


accaccaccc 


gcgtgccgcc 


ggtgggctcc 


gtgggcgtga 


ccccgtgggc 


caacgcctac 


ccgtccctca 


ccggcgccat 


ggccaccgcc 


atgtggctcg 


acaccctcga 


caagaccccg 


accgccaaca 


agaacggcgg 


caactacgcc 


cgcgactgcg 


ccgccc tcgc 


c tccaacggc 


aagtacaaga 


actacatcga 


caccatccgc 


accctcctcg 


tgatcgagcc 


ggactccctc 


aagtgcgcca 


acgcccagtc 


cgcctacctc 


aacctcccga 


acgtggccat 


gtacctcgac 


gccaaccagg 


acccggccgc 


ccagctcttc 


cgcgccctcc 


gcggcctcgc 


caccaacgtg 


ccgccgtcct 


acacccaggg 


caacgccgtg 


ggcccgctcc 


tcgccaacca 


cggctggtcc 


tccggcaagc 


agccgaccgg 


ccagcagcag 


ggct tcggca 


tccgcccgtc 


cgccaacacc 


gtgaagccgg 


gcggcgagtg 


cgacggcacc 



PCT/US2004/007182 

gcccggccac caccaccggc tcctccccgg 13 80 
gcggcatcgg ctactccggc ccgaccgtgt 1440 
acccgtacta ctcccagtgc ctctag 1496 



tcctccgtgt ggggccagtg cggcggccag 60 

ggctccacct gcgtgtactc caacgactac 120 

tcctcctcct ccacccgcgc cgcctccacc 180 

tcctcctccg ccaccccgcc gccgggctcc 240 

ggcaccgcca cctactccgg caacccgttc 300 

tacgcctccg aggtgtcctc cctcgccatc 360 

gccgccgccg tggccaaggt gccgtccttc 420 

ctcatggagc agaccctcgc cgacatccgc 4 80 

ggccagttcg tggtgtacga cctcccggac 540 

gagtactcca tcgccgacgg cggcgtggcc 600 

cagatcgtgg tggagtactc cgacatccgc 660 

gccaacctcg tgaccaacct cggcaccccg 720 

gagtgcatca actacgccgt gacccagctc 780 

gccggccacg ccggctggct cggctggccg 84 0 

gccaacgtgt acaagaacgc ctcctccccg 900 

gccaactaca acggctggaa catcacctcc 960 

tacaacgaga agctctacat ccacgccatc 1020 

aacgccttct tcatcaccga ccagggccgc 1080 

tggggcgact ggtgcaacgt gatcggcacc 1140 

ggcgactccc tcctcgactc cttcgtgtgg 1200 

tccgactcct ccgccccgcg cttcgactcc 1260 
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5 



2} 



35 



45 



cactgcgccc tcccggacgc cctccagccg gccccgcagg ccggcgcctg gttccaggcc 1320 
tacttcgtgc agctcctcac caacgccaac ccgtccttcc tctag 1365 



<210> 95 

<211> 1317 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223 > Maize optimized EG LI 

15 <400> 95 



55 



atgcagcagc 


cgggcacctc 


caccccggag 


gtgcacccga 


agctcaccac 


ctacaagtgc 


60 


accaagtccg 


gcggctgcgt 


ggcccaggac 


acctccgtgg 


tgctcgactg 


gaactaccgc 


120 


tggatgcacg 


acgccaacta 


caactcctgc 


accgtgaacg 


gcggcgtgaa 


caccaccctc 


180 


tgcccggacg 


aggccacctg 


cggcaagaac 


tgcttcatcg 


agggcgtgga 


ctacgccgcc 


240 


cccggcgtga 


ccacctccgg 


ctcctccctc 


accatgaacc 


agtacatgcc 


gtcctcctcc 


300 


ggcggctact 


cctccgtgtc 


cccgcgcctc 


tacctcctcg 


actccgacgg 


cgagtacgtg 


360 


atgctcaagc 


tcaacggcca 


ggagctctcc 


ttcgacgtgg 


acctctccgc 


cctcccgtgc 


420 


ggcgagaacg 


gctccctcta 


cctctcccag 


atggacgaga 


acggcggcgc 


caaccagtac 


480 


aacaccgccg 


gcgccaacta 


cggctccggc 


tactgcgacg 


cccagtgccc 


ggtgcagacc 


540 


cggcgcaacg 






caciaacttct 


actocaaccia 


qatcrqacatc 


600 


ctcgagggca 


actcccgcgc 


caacgccctc 


accccgcact 


cctgcaccgc 


caccgcctgc 


660 


gactccgccg 


gctgcggctt 


caacccgtac 


ggctccggct 


acaagtccta 


ctacggcccg 


720 


ggcgacaccg 


tggacacctc 


caagaccttc 


accatcatca 


cccagttcaa 


caccgacaac 


780 


ggctccccgt 


ccggcaacct 


cgtgtccatc 


acccgcaagt 


accagcagaa 


cggcgtggac 


840 


atcccgtccg 


cccagccggg 


cggcgacacc 


atctcctcct 


gcccgtccgc 


ctccgcctac 


900 


ggcggcctcg 


ccaccatggg 


caaggccctc 


tcctccggca 


tggtgctcgt 


gttctccatc 


960 


tggaacgaca 


actcccagta 


catgaactgg 


ctcgactccg 


gcaacgccgg 


cccgtgctcc 


1020 


tccaccgagg 


gcaacccgtc 


caacaccctc 


gccaacaacc 


cgaacaccca 


cgtggtgttc 


1080 


tccaacatcc 


gctggggcga 


catcggctcc 


accaccaact 


ccaccgcccc 


gccgccgccg 


1140 


ccggcctcct 


ccaccacctt 


ctccaccacc 


cgccgctcct 


ccaccacctc 


ctcctccccg 


1200 


tcctgcaccc 


agacccactg 


gggccagtgc 


ggcggcatcg 


gctactccgg 


ctgcaagacc 


1260 
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tgcacctccg gcaccacctg ccagtactcc aacgactact actcccagtg cctctag 1317 



<210> 96 

5 <211> 1401 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Maize optimized BGLII 



<400> 96 





atgctcccga 


aggacttcca 


gtggggcttc 


gccaccgccg 


cctaccagat 


cgagggcgcc 


60 


15 


gtggaccagg 


acggccgcgg 


cccgtccatc 


tgggacacct 


tctgcgccca 


geegggcaag 


120 




atcgccgacg 


gctcctccgg 


cgtgaccgcc 


tgcgactcct 


acaaccgcac 


cgccgaggac 


180 


20 


atcgccctcc 


tcaagtccct 


cggcgccaag 


tcctaccgct 


tctccatctc 


ctggtcccgc 


240 


atcatcccgg 


agggeggecg 


cggcgacgcc 


gtgaaccagg 


ccggcatcga 


ccactacgtg 


300 




aagttcgtgg 


acgacctcct 


cgacgccggc 


atcaccccgt 


tcatcaccct 


cttccactgg 


360 


25 


gacctcccgg 


agggcctcca 


ccagcgctac 


ggcggcctcc 


tcaaccgcac 


cgagttcccg 


420 




ctcgact teg 


agaactaege 


ccgcgtgatg 


ttccgcgccc 


tcccgaaggt 


gcgcaactgg 


480 


30 


atcaccttca 


acgagccgct 


ctgctccgcc 


atcccgggct 


acggctccgg 


caccttcgcc 


540 


ccgggccgcc 


agtccacctc 


cgagccgtgg 


accgtgggcc 


acaacatcct 


cgtggcccac 


600 




ggccgcgccg 


tgaaggecta 


ccgcgacgac 


ttcaagcegg 


cctccggcga 


cggccagatc 


660 


35 


ggcatcgtgc 


teaaeggega 


cttcacctac 


ccgtgggacg 


ccgccgaccc 


ggccgacaag 


720 




gaggccgccg 


agcgccgcct 


cgagttcttc 


accgcctggt 


tcgccgaccc 


gatctacctc 


780 


40 


ggcgactacc 


cggcctccat 


gcgcaagcag 


ctcggcgacc 


gcctcccgac 


cttcaccccg 


840 


gaggagegeg 


ccctcgtgca 


cggctccaac 


gacttctacg 


gcatgaacca 


ctacacctcc 


900 




aactacatcc 


gccaccgctc 


ctccccggcc 


tccgccgacg 


acaccgtggg 


caacgtggac 


960 


45 


gtgetcttea 


ccaacaagca 


gggcaactgc 


atcggcccgg 


agacccagtc 


cccgtggctc 


1020 




cgcccgtgcg 


ccgccggctt 


ccgcgacttc 


ctcgtgtgga 


tctccaagcg 


ctacggctac 


1080 


50 


ccgccgatct 


aegtgacega 


gaacggcacc 


tccatcaagg 


gegagtcega 


cctcccgaag 


1140 


gagaagatcc 


tcgaggacga 


cttccgcgtg 


aagtactaca 


acgagtacat 


ccgcgccatg 


1200 




gtgaccgccg 


tggagctcga 


eggegtgaac 


gtgaagggct 


acttcgcctg 


gtccctcatg 


1260 


55 


gacaacttcg 


agtgggccga 


eggctaegtg 


acccgcttcg 


gcgtgaccta 


cgtggactac 


1320 




gagaaeggee 


agaagegett 


cccgaagaag 


tccgccaagt 


ccctcaagcc 


gctcttcgac 


1380 
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gagctcatcg ccgccgccta g 1401 



5 <210> 97 

<2U> 2103 

<212> DNA 

< 2 1 3 > Artificial Sequence 

10 <220> 

<223> Maize optimized CEL3D 



<400> 97 

atgatcctcg gctgcgagtc caccggcgtg atctccgccg tgaagcactt cgtggccaac 60 

15 

gaccaggagc acgagcgccg cgccgtggac tgcctcatca cccagcgcgc cctccgcgag 120 

gtgtacctcc gcccgttcca gatcgtggcc cgcgacgccc gcccgggcgc cctcatgacc 180 

20 tcctacaaca aggtgaacgg caagcacgtg gccgactccg ccgagttcct ccagggcatc 240 



ctccgcaccg agtggaactg ggacccgctc atcgtgtccg actggtacgg cacctacacc 300 



accatcgacg ccatcaaggc cggcctcgac ctcgagatgc cgggcgtgtc ccgctaccgc 360 

25 

ggcaagtaca tcgagtccgc cctccaggcc cgcctcctca agcagtccac catcgacgag 420 



cgcgcccgcc gcgtgctccg cttcgcccag aaggcctccc acctcaaggt gtccgaggtg 480 
30 gagcagggcc gcgacttccc ggaggaccgc gtgctcaacc gccagatctg cggctcctcc 54 0 



atcgtgctcc tcaagaacga gaactccatc ctcccgctcc cgaagtccgt gaagaaggtg 600 



gccctcgtgg gctcccacgt gcgcctcccg gccatctccg gcggcggctc cgcctccctc 660 

35 

gcgccgtact acgccatccc cctctacgac gccgtgtccg aggtgctcgc cggcgccacc 720 



atcacccacg aggtgggcgc ctacgcccac cagatgctcc cggtgatcga cgccatgatc 780 
40 tccaacgccg tgatccactt ctacaacgac ccgatcgacg tgaaggaccg caagctcctc 840 



ggctccgaga acgtgtcctc cacctccttc cagctcatgg actacaacaa catcccgacc 900 



ctcaacaagg ccatgttctg gggcaccctc gtgggcgagt tcatcccgac cgccaccggc 960 

45 

atccgggagt tcggcctctc cgtgttcggc accgccgacc tctacatcga caacgagctc 1020 



gtgatcgaga acaccaccca ccagacccgc ggcaccgcct tcttcggcaa gggcaccacc 1080 
50 gagaaggtgg ccacccgccg catggtggcc ggctccacct acaagctccg cctcgagttc 1140 



ggctccgcca acaccaccaa gatggagacc accggcgtgg tgaacttcgg cggcggcgcc 1200 



gtgcacctcg gcgcctgcct caaggtggac ccgcaggaga tgatcgcccg cgccgtgaag 1260 

55 

gccgccgccg acgccgacta caccatcatc tgcaccggcc tctccggcga gtgggagtcc 1320 
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gagggcttcg 


accgcccgca 


catggacctc 




gtgctcgacg 


ccgccccgaa 


cgccgtggtg 


5 


tcccgggccc 


acaaggccaa 


ggccatcgtg 




cacggcatct 


ccgacgtgct 


cttcggcaac 


10 


tggccggtgg 
gtgctctacg 


acgtgaagca 
gcgaggacgt 


caacccggcc 
gtacgtgggc 




gtgctcttcc 


cgttcggcca 


cggcctctcc 


15 


gtgcgcaccg 


tgccggagac 


cttccacccg 




aagaacacct 


cctccgtgcc 


gggcgcccag 


20 


tccccgaccc 


accgcccggt 
^yy yy vy v,a 


gaaggagctc 




a tcgagtcca 


tgtggaagtc 


cgagcgcggc 


25 


caggagat ct 
tag 


ccggcaaggg 


caagctcatc 


30 


<210> 98 
<211> 420 
<212> DNA 
<213> zea 


mays 




35 


<220> 

<223> Q protein promoter 


40 


<400> 98 
gggctggtaa 

ctagttgtca 


attacttggg 
caagttgtat 


agcaatggta 
atcgattcgt 




acaagtaaca 


caatattaca 


aagttagttt 


45 


tgacatgtaa 


agtgaagctt 


attatacttc 




acaaaggtcc 


aaaaatccac 


atcaaccatt 


50 


attatccaac 
agaaattgtg 


aaatgtttat 
ccaacaaagc 


caatgtggta 
tataagtaac 



<210> 99 

55 <211> 1188 

<212> DNA 

<213> artificial sequence 



PCT/US2004/007182 

ccgccgggcg tggacaccat gatctcccag 1380 

gtgaaccagt ccggcacccc ggtgaccatg 1440 

caggcctggt acggcggcaa cgagaccggc 1500 

gtgaacccgt ccggcaagct ctccctctcc 1560 

tacctcaact acgcctccgt gggcggccgc 1620 

tacaagttct acgacaagac cgagcgcgag 1680 

tacgccacct tcaagctccc ggactccacc 1740 

gaccagccga ccgtggccat cgtgaagatc 1800 

gtgctccagc tctacatctc cgccccgaac 1860 

cacggcttcg agaaggtgta cctcgaggcc 1920 

gaccagtacg ccacctcctt ctgggacgag 1980 

atctacgacg tgctcgtggg cttctcctcc 2040 

gtgccggaga cccgcttctg gatgggcctc 2100 

2103 



tgcaaatcct ttgcatgtac gcaaaactag 60 

cgcgtttcaa caactcatgc aacattacaa 120 

catacaaagc aagaaaagga caataatact 160 

ctaatccaac acaaaacaaa aaaaagttgc 240 

aacctatacg taaagtgagt gatgagtcac 300 

tcatacaagc attgacatcc cataaatgca 360 

cctcatatgt atttgcactc atgcatcaca 420 
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<220> 





<223> synthetic ferulic acid esterase 








5 


<400> 99 
atgyccycct 






tccaac taca 


accaggtgcg 


caaeggegtg 


60 




ccgcgcggcc 


aggeggegaa 


LdLL ^v>v> tat 




ccaccaactc 


cacccgcccg 


120 


10 


gcccgcgtgc 


a o p f - r* n r* r* 
c* v t» ILu^y 


nncir hflrtrc 


oaM >j o c* c* y o 


agtactccgt 


gctctacctc 


180 




c tccacggca 


t eggeggee c 


cydydatydt 


wyy t ttydyy 


gcggcggccg 


cgccaacgtg 


240 


15 


accgccgaca 


acc c ca uege 


c g°ggy 


altaayttyt 


teatcategt 


gaccccgaac 


300 


accaacgccg 


ccggcccggg 


Catcgccydc 




act tcaccaa 


ggacctcctc 


360 




aactccc tea 


t cccgtacac 


cgag ccca a c 


laCCCCytyL 


acaccgaccg 


cgagcaccgc 


420 


20 


geca tegecg 


gect ccct at 


/-^ j-m r** /^t ^ 


tdy ItL L LtOl 


acatcggcct 


caccaacctc 


480 




gacaagt t eg 


cc taca tegg 


cccgacctcc 


yccyccutya 


acacctaccc 


gaacgagege 


540 


25 


ctct tecegg 


aeggeggcaa 


ggccgcccgc 


gagaagecca 


agctcctctt 


catcgcctgc 


600 


ggcaccaacg 


ac tccc tea t 


egge t cegge 


cagcgcgcgc 


acgagtactg 


cgtggccaac 


660 




aaca tcaacc 


aegtg tactg 


gc ucacccag 


ggcggcggcc 


acgacttcaa 


cgtgtggaag 


720 


30 


ccgggcc tc t 


ggaac t tec t 


ccagacggcc 


gaegaggecg 


gcctcacccg 


cgacggcaac 


780 




accccgg tgc 


cgaccccg uc 


^ ^ ^ ^1 a ^ /™T 

cccyadgccy 


y LtddLdLLt 


gcatcgaggc 


cgaggactac 


840 


1 s 


gaeggea tea 


actcctcctc 


catcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


ateggctaca 


tcacctccgg 


cgactacctc 


gtgtacaagt 


ccatcgactt 


cggcaacggc 


960 




gccacctcct 


teaaggecaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


1020 


40 


aacggcccga 


acggcaccct 


catcggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 


1080 




acctacgagg 


agcagacctg 


ctccatctcc 


aaggtgaccg 


gcatcaacga 


cctctacctc 


1140 




gtgt tcaagg 


gcccggtgaa 


catcgactgg 


ttcaccttcg 


gcgtgtag 




1188 



45 

<210> 100 

<211> 395 

<212> PRT 

50 <213> artificial sequence 

<220> 

<223> synthetic ferulic acid esterase 

55 <400> 100 

Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 
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15 10 15 



Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser 
5 20 25 30 



Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 

10 

Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly lie 
50 55 60 

15 

Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65 70 75 80 



20 He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro Leu He He 

85 90 95 



Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 
25 100 105 110 



Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr lie Glu 
115 120 125 

30 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly 
130 135 140 

35 

Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 150 155 160 



40 Asp Lys Phe Ala Tyr lie Gly Pro He Ser Ala Ala Pro Asn Thr Tyr 

165 170 175 



Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 
45 180 185 190 



Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

50 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His 
210 215 220 

55 

Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 
225 230 235 240 
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5 



10 



25 



30 



55 



Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 



Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 



Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser Ser Ser He 
275 280 285 



15 Glu lie He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He 
290 295 300 



Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
20 305 310 315 320 



Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 



Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser 

340 345 350 



Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 
355 360 365 



35 


He Ser 


Lys Val Thr Gly He 




370 


375 




Pro Val 


Asn lie Asp Trp Phe 


40 


385 


390 




<210> 


101 




<211> 


1188 


45 


<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


plasmid 13036 


50 








<400> 


101 



380 



395 



atggccgcct ccctcccgac catgccgccg tccggctacg accaggtgcg caacggcgtg 60 
ccgcgcggcc aggtggtgaa catctcctac ttctccaccg ccaccaactc cacccgcccg 120 
gcccgcgtgt acctcccgcc gggctactcc aaggacaaga agtactccgt gctctacctc 180 
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ctccacggca 


tcggcggctc 


cgagaacgac 


tggttcgagg 


gcggcggccg 


cgccaacgtg 


240 


atcgccgaca 


acctcatcgc 


cgagggcaag 


atcaagccgc 


tcatcatcgt 


gaccccgaac 


300 


accaacgccg 


ccggcccggg 


catcgccgac 


ggctacgaga 


acttcaccaa 


ggacctcctc 


360 


aactccctca 


tcccgtacat 


cgagtccaac 


tactccgtgt 


acaccgaccg 


cgagcaccgc 


420 


gccatcgccg 


gcctctctat 


gggcggcggc 


cagtccttca 


acatcggcct 


caccaacctc 

• 


480 


gacaagttcg 


cctacatcgg 


cccgatctcc 


gccgccccga 


acacctaccc 


gaacgagcgc 


540 


ctct tcccgg 


acggcggcaa 


ggccgcccgc 


gagaagctca 


agctcctctt 


caccgcctgc 


600 


ggcaccaacg 


actccctcat 


cggcttcggc 


cagcgcgtgc 


acgagtactg 


cgtggccaac 


660 


aaca tcaacc 


acgtgtactg 


gctcatccag 


ggcggcggcc 


acgacttcaa 


cgtgtggaag 


720 


ccgggcctct 


ggaacttcct 


ccagatggcc 


gacgaggccg 


gcctcacccg 


cgacggcaac 


780 


accccggtgc 


cgaccccgtc 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


840 


gacggcatca 


actcctcctc 


catcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


atcggctaca 


tcacctccgg 


cgactacctc 


gtgtacaagt 


ccatcgactt 


cggcaacggc 


960 


gccacctcct 


tcaaggccaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


1020 


aacggcccga 


acggcaccct 


catcggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 


1080 


acctacgagg 


agcagacctg 


ctccatctcc 


aaggtgaccg 


gcatcaacga 


cctctacctc 


1140 


gtgt tcaagg 


gcccggtgaa 


catcgactgg 


ttcaccttcg 


gcgtgtag 




1188 



<210> 102 
<211> 395 
<212> PRT 

<213> artificial sequence 
<220> 

c223> plasmid 13036 
<400> 102 

Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 
15 10 15 

Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn He Ser Tyr Phe Ser 

20 25 30 

Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 
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Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly lie 
50 55 60 



5 Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65 70 75 80 



lie Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro Leu He He 
10 85 90 95 



Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 

100 105 HO 

15 

Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu 
115 120 125 

20 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly 
130 135 140 



25 Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 150 155 160 



Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro Asn Thr Tyr 
30 165 170 175 



Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 

180 185 190 

35 

Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

40 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His 
210 215 220 



45 val Tyr Trp Leu lie Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 
225 230 235 240 



Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 
50 245 250 255 



Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 

55 

Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie 
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275 280 285 



Glu He He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr lie 
5 290 295 300 



Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 310 315 320 

10 

Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 

15 

Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser 

340 345 350 



20 Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 

355 360 365 



lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val Phe Lys Gly 
25 370 375 380 



30 



Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 
385 390 395 



<210> 103 

<21b 1245 

<212> DNA 

35 <213> artificial sequence 

<220> 

<223> plasmid 13038 

40 <400> 103 





atgagggtgt 


tgctcgttgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 




gccgcctccc 


tcccgaccat 


gccgccgtcc 


ggctacgacc 


aggcgcgcaa 


cggcgtgccg 


120 


45 


cgcggccagg 


tggtgaacat 


ctcctacttc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


180 




cgcgtgtacc 


tcccgccggg 


ctactccaag 


gacaagaagc 


actccgtgct 


ctacctcctc 


240 


50 


cacggcatcg 


gcggctccga 


gaacgactgg 


ttcgagggcg 


gcggccgcgc 


caacgtgatc 


300 


gccgacaacc 


tcaccgccga 


gggcaagacc 


aagccgctca 


ccatcgtgac 


cccgaacacc 


360 




aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaacc 


tcaccaagga 


cctcctcaac 


420 


55 


tccctcatcc 


cgtacatcga 


gtccaactac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


480 




atcgccggcc 


tctctatggg 


cggcggccag 


cccttcaaca 


tcggcctcac 


caacctcgac 


540 
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aagttcgcct 


acatcggccc 


gatctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


600 


ttcccggacg 


gcggcaaggc 


cgcccgcgag 


aagctcaagc 


tcctcttcat 


cgcctgcggc 


660 


accaacgact 


ccctcatcgg 


cttcggccag 


cgcgtgcacg 


agcactgcgt 


ggccaacaac 


720 


atcaaccacg 


tgtactggct 


catccagggc 


ggcggccacg 


acttcaacgt 


gtggaagccg 


780 


ggcctctgga 


acttcctcca 


gatggccgac 


gaggccggcc 


tcacccgcga 


cggcaapacc 


840 


ccggtgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tcgaggccga 


ggactacgac 


900 


ggcatcaact 


cctcctccat 


cgagatcatc 


ggcgtgccgc 


cggagggcgg 


ccgcggcatc 


960 


QQCtacatca 

W V* ^* W 


cctccqocoa 

^» w ™2 ~j 


CtacCtCQtq 


tacaaa tcca 


t cQac t 1 coa 


caacaQCQcc 

™^ 


1020 


acctccttca 


aggccaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1080 


ggcccgaacg 


gcaccctca t 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


tacgaggagc 


agacctgctc 


catctccaag 


gtgaccggca 


tcaacgacct 


ctacctcgtg 


1200 


ttcaagggcc 


cggtgaacat 


cgactggt tc 


accttcggcg 


tgtag 




1245 



<210> 104 
<211> 414 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13038 aa 
<400> 104 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 



Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 



Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 45 



Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 



His Gly lie Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 
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85 



90 



95 



Ala Asn Val lie Ala Asp Asn Leu He Ala Glu Gly Lys lie Lys Pro 
5 100 105 110 



10 



15 



Leu He He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 



Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro 
130 135 140 



Tyr lie Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 



20 He Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu 

165 170 175 



Thr Asn Leu Asp Lys Phe Ala Tyr He Gly Pro lie Ser Ala Ala Pro 
25 180 185 190 



Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 



30 



35 



Arg Glu Lys Leu Lys Leu Leu Phe lie Ala Cys Gly Thr Asn Asp Ser 
210 215 220 



Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 



40 He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 

245 250 255 



Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 
45 260 265 270 



Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 



50 



Pro Ala Asn Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly He Asn Ser 
290 295 300 



55 



Ser Ser lie Glu lie lie Gly Val Pro Pro Glu Gly Gly Arg Gly lie 
305 310 315 320 
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Gly Tyr lie Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser lie Asp Phe 

325 330 335 

5 

Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

10 

Ser Asn lie Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly 
355 360 365 



15 Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 
370 375 380 



Thr Cys Ser lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val 
20 385 390 395 400 



Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 

405 410 

25 

<210> 105 
<211> 1425 
<212> DNA 

30 <213> artificial sequence 

<220> 

<223> plasmid 13039 

35 <400> 105 



45 



55 



atgctggcgg 


ctctggccac 


gtcgcagctc 


gtcgcaacgc 


gcgccggcct 


gggcgtcccg 


60 


gacgcgtcca 


cgt tccgccg 


cggcgccgcg 


cagggcctga 


ggggggcccg 


ggcgtcggcg 


120 


gcggcggaca 


cgctcagcat 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


180 


caggcgcgcc 


gcggggccag 


gttcccgtcg 


ctcgtcgtgt 


gcgccagcgc 


cggcgccatg 


240 


gccgcctccc 


tcccgaccat 


gccgccgtcc 


ggctacgacc 


aggtgcgcaa 


cggcgtgccg 


300 


cgcggccagg 


tggtgaacat 


ctcctacttc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


360 


cgcgtgtacc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


420 


cacggcatcg 


gcggctccga 


gaacgactgg 


ttcgagggcg 


gcggccgcgc 


caacgtgatc 


480 


gccgacaacc 


tcatcgccga 


gggcaagatc 


aagccgctca 


tcatcgtgac 


cccgaacacc 


540 


aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


600 


tccctcatcc 


cgtacatcga 


gtccaaccac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


660 
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atcgccggcc 


tctctatggg 


cggcggccag 


tccttcaaca 


tcggcctcac 


caacctcgac 


720 




aagttcgcct 


acatcggccc 


gatctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


780 


5 


ttcccggacg 


geggcaagge 


cgcccgcgag 


aagctcaagc 


tcctcttcat 


cgcctgcggc 


840 




accaacgact 


ccctcatcgg 


cttcggccag 


cgcgtgcacg 


agtactgcgt 


ggccaacaac 


900 


10 


atcaaccacg 


tgtactggct 


catccagggc 


ggcggccacg 


acttcaacgt 


gtggaagccg 


960 


ggcctctgga 


acttcctcca 


gatggecgae 


gaggccggcc 


tcacccgcga 


cggcaacacc 


1020 




ccggtgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tegaggcega 


ggactacgac 


1080 


15 


ggcatcaact 


cctcctccat 


cgagatcatc 


ggcgtgccgc 


eggagggegg 


* 

ccgcggcatc 


1140 




ggctacatca 


cctccggcga 


ctacctcgtg 


tacaagtcca 


tegacttegg 


caacggcgcc 


1200 


20 


acctcct tea 


aggecaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1260 


ggcccgaacg 


gcaccctcat 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1320 




tacgaggagc 


agacctgc tc 


catctccaag 


gtgaceggea 


tcaacgacct 


ctacctcgtg 


1380 


25 


ttcaagggee 


eggtgaacat 


cgactggttc 


accttcggcg 


tgtag 




1425 



<210> 106 

<211> 474 

30 <212> PRT 

<213> artificial sequence 



35 



40 



45 



<220> 

<223> plasmid 13039 aa 
c400> 106 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 



Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 25 30 



Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 



50 Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 
50 55 60 



Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala Met 
55 65 70 75 80 
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Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val Arg 

85 90 95 



5 Asn Gly Val Pro Arg Gly Gin Val Val Asn He Ser Tyr Phe Ser Thr 

100 105 HO 



Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly Tyr 
10 115 120 125 



Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly He Gly 
130 135 140 



15 



Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val He 
145 150 155 160 



20 



Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro Leu He He Val 

165 170 175 



25 Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr Glu 

180 185 190 



Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu Ser 
30 195 * 200 205 



Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly Leu 
210 215 220 



35 



Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu Asp 
225 230 235 240 



40 



Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro Asn Thr Tyr Pro 

245 250 255 



45 Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys Leu 

260 265 270 



Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly Phe 
50 275 280 285 



55 



Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His Val 
2 90 295 3 00 



Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys Pro 
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305 310 315 320 



Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr Arg 
5 * 325 330 335 



10 



15 



30 



35 



Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn Thr 

340 345 350 



Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser Ser Ser He Glu 
355 360 365 



He He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He Thr 
370 375 380 



20 Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly Ala 
385 390 395 400 



Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He Glu 
25 405 410 415 



Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser Val 

420 425 430 



Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser He 
435 440 445 



Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val Phe Lys Gly Pro 
450 455 460 



40 Val Asn He Asp Trp Phe Thr Phe Gly Val 
465 470 



<210> 107 
45 <211> 1263 
<212> DNA 

<213> artificial sequence 
<220> 

50 <223> plasmid 13347 
<400> 107 

atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc cacctccatg 60 
55 gccgcctccc tcccgaccat gccgccgtcc ggctacgacc aggtgcgcaa cggcgtgccg 120 
cgcggccagg tggtgaacat ctcctacttc tccaccgcca ccaactccac ccgcccggcc 180 
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cgcgtgtacc tcccgccggg ctactccaag gacaagaagt actccgtgct ctacctcctc 240 

cacggcatcg gcggctccga gaacgactgg ttcgagggcg gcggccgcgc caacgtgatc 300 

gccgacaacc tcatcgccga gggcaagatc aagccgctca tcatcgtgac cccgaacacc 360 

aacgccgccg gcccgggcat cgccgacggc tacgagaact tcaccaagga cctcctcaac 420 

tccctcatcc cgtacatcga gtccaactac tccgtgtaca ccgaccgcga gcaccgcgcc 480 

atcgccggcc tctctatggg cggcggccag tccttcaaca tcggcctcac caacctcgac 540 

aagttcgcct acatcggccc gatctccgcc gccccgaaca cctacccgaa cgagcgcctc 600 

ttcccggacg gcggcaaggc cgcccgcgag aagctcaagc tcctcttcat cgcctgcggc 660 

accaacgact ccctcatcgg cttcggccag cgcgtgcacg agtactgcgt ggccaacaac 720 

atcaaccacg tgtactggct catccagggc ggcggccacg acttcaacgt gtggaagccg 780 

ggcctctgga acttcctcca gatggccgac gaggccggcc tcacccgcga cggcaacacc 840 

ccggtgccga ccccgtcccc gaagccggcc aacacccgca tcgaggccga ggactacgac 900 

ggcatcaact cctcctccat cgagatcatc ggcgtgccgc cggagggcgg ccgcggcatc 960 

ggctacatca cctccggcga ctacctcgtg tacaagtcca tcgacttcgg caacggcgcc 1020 

acctccttca aggccaaggt ggccaacgcc aacacctcca acatcgagct tcgcctcaac 1080 

ggcccgaacg gcaccctcat cggcaccctc tccgtgaagt ccaccggcga ctggaacacc 1140 

tacgaggagc agacctgctc catctccaag gtgaccggca tcaacgacct ctacctcgtg 1200 

ttcaagggcc cggtgaacat cgactggttc accttcggcg tgtccgagaa ggacgaactc 1260 

tag 12 63 

<210> 108 
<211> 420 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13347 
<400> 108 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 



Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 
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Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 45 

5 

Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



10 Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 



His Gly He Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 
15 85 90 95 



Ala Asn Val He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro 

100 105 110 

20 

Leu He He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 

25 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro 
130 135 140 



30 Tyr He Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 



He Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu 
35 165 170 175 



Thr Asn Leu Asp Lys Phe Ala Tyr lie Gly Pro He Ser Ala Ala Pro 

180 185 190 

40 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

45 

Arg Glu Lys Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser 
210 215 220 



50 Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 



He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 
55 245 250 255 
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Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 45 

5 

Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



10 Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 



His Gly He Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 
15 85 90 95 



Ala Asn Val He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro 

100 105 110 

20 

Leu He He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 

25 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro 
130 135 140 



30 Tyr lie Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 



lie Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu 
35 165 170 175 



Thr Asn Leu Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro 

180 185 190 

40 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

45 

Arg Glu Lys Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser 
210 215 220 



50 Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 



He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 
55 245 250 255 
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Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 

260 265 270 



Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 



Pro Ala Asn Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser 
290 295 300 



Ser Ser He Glu He Tie Gly Val Pro Pro Glu Gly Gly Arg Gly He 
305 310 315 320 



Gly Tyr He Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe 

325 330 335 



Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 



Ser Asn lie Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly 
355 360 365 



Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 
370 375 380 



Thr Cys Ser He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val 
385 390 395 400 



Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val Ser Glu 

405 410 415 



Lys Asp Glu Leu 

420 



<210> 109 
<211> 1296 
<212> DNA 

<213> artificial sequence 
<220> 

<223> plasmid 11267 
<400> 109 

atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 60 
gcgcagtccg agccggagct gaagctggag tccgtggtga tcgtgtcccg ccacggcgtg 120 
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cgcgccccga ccaaggccac ccagctcatg caggacgtga ccccggacgc ctggccgacc 180 

tggccggtga agctcggcga gctgaccccg cgcggcggcg agctgatcgc ctacctcggc 240 

cactactggc gccagcgcct cgtggccgac ggcctcctcc cgaagtgcgg ctgcccgcag 300 

tccggccagg tggccatcat cgccgacgtg gacgagcgca cccgcaagac cggcgaggcc 360 

ttcgccgccg gcctcgcccc ggactgcgcc atcaccgtgc acacccaggc cgacacctcc 420 

tccccggacc cgctcttcaa cccgctcaag accggcgtgt gccagctcga caacgccaac 480 

gtgaccgacg ccatcctgga gcgcgccggc ggctccatcg ccgacttcac cggccactac 540 

cagaccgcct tccgcgagct ggagcgcgtg ctcaacttcc cgcagtccaa cctctgcctc 600 

aagcgcgaga agcaggacga gtcctgctcc ctcacccagg ccctcccgtc cgagctgaag 660 

gtgtccgccg actgcgtgtc cctcaccggc gccgtgtccc tcgcctccat gctcaccgaa 720 

atcttcctcc tccagcaggc ccagggcatg ccggagccgg gctggggccg catcaccgac 780 

tcccaccagt ggaacaccct cctctccctc cacaacgccc agttcgacct cctccagcgc 840 

accccggagg tggcccgctc ccgcgccacc ccgctcctcg acctcatcaa gaccgccctc 900 

accccgcacc cgccgcagaa gcaggcctac ggcgtgaccc tcccgacctc cgtgctcttc 960 

atcgccggcc acgacaccaa cctcgccaac ctcggcggcg ccctggagct gaactggacc 1020 

ctcccgggcc agccggacaa caccccgccg ggcggcgagc tggtgttcga gcgctggcgc 1080 

cgcctctccg acaactccca gtggattcag gtgtccctcg tgttccagac cctccagcag 1140 

atgcgcgaca agaccccgct ctccctcaac accccgccgg gcgaggtgaa gctcaccctc 1200 

gccggctgcg aggagcgcaa cgcccagggc atgtgctccc tcgccggctt cacccagatc 1260 

gtgaacgagg cccgcatccc ggcctgctcc ctctaa 1296 

<210> 110 
<211> 431 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 11267 aa sequence 
<400> 110 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 



Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 

20 25 30 
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Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
35 40 45 



Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 



Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu Gly 
65 70 75 80 



15 His Tyr Trp Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 

85 90 95 



Gly Cys Pro Gin Ser Gly Gin Val Ala He lie Ala Asp Val Asp Glu 
20 100 105 HO 



Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
115 120 125 



25 



Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 140 



30 



Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 
145 150 155 160 



35 val Thr Asp Ala Tie Leu Glu Arg Ala Gly Gly Ser He Ala Asp Phe 

165 170 175 



Thr Gly His Tyr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 
40 180 185 190 



Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
195 200 205 



45 



Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
210 215 220 



50 



Cys Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 



55 lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

245 250 255 
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Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 

260 265 270 

5 

Ala Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 

10 Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His Pro 
290 295 300 

Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 
15 305 310 315 320 

He Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 

20 

Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 

25 

Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 

30 He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 380 

Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
35 385 390 395 400 

Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

405 410 415 

40 

Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu 

420 425 430 

45 

<210> 111 
<2H> 1314 
<212> DNA 

<213> artificial sequence 

50 

<220> 

<223> plasmid 11268 
<400> 111 

55 atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 60 
gcgcagtccg agccggagct gaagctggag tccgtggtga tcgtgtcccg ccacggcgtg 120 
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15 



20 



25 



30 



35 



50 



55 



cgcgccccga 


ccaaggccac 


ccagctcatg 


caggacgtga 


ccccggacgc 


ctggccgacc 


tggccggtga 


agctcggcga 


gctgaccccg 


cgcggcggcg 


agctgatcgc 


ctacctcggc 


cactactggc 


gccagcgcct 


cgtggccgac 


ggcctcctcc 


cgaagtgcgg 


ctgcccgcag 


tccggccagg 


tggccatcat 


cgccgacgtg 


gacgagcgca 


cccgcaagac 


cggcgaggcc 


ttcgccgccg 


gcctcgcccc 


ggactgcgcc 


atcaccgtgc 


acacccaggc 


cgacacctcc 


tccccggacc 


cgctcttcaa 


cccgct caag 


accggcgtgt 


gccagctcga 


caacgccaac 


gtgaccgacg 


ccatcctgga 


gcgcgccggc 


ggctccatcg 


ccgacttcac 


cggccactac 


cagaccgcct 


tccgcgagct 


ggagcgcgtg 


ctcaacttcc 


cgcagtccaa 


cctctgcctc 


aagcgcgaga 


agcaggacga 


gtcctgctcc 


ctcacccagg 


ccctcccgtc 


cgagctgaag 


gtgtccgccg 


actgcgtgtc 


cctcaccggc 


gccgtgtccc 


tcgcctccat 


gctcaccgaa 


atcttcctcc 


tccagcaggc 


ccagggcatg 


ccggagccgg 


gctggggccg 


catcaccgac 


tcccaccagt 


ggaacaccct 


cctctccctc 


cacaacgccc 


agttcgacct 


cctccagcgc 


accccggagg 


tggcccgctc 


ccgcgccacc 


ccgctcctcg 


acctcatcaa 


gaccgccctc 


accccgcacc 


cgccgcagaa 


gcaggcctac 


ggcgtgaccc 


tcccgacctc 


cgtgctcttc 


atcgccggcc 


acgacaccaa 


cctcgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


cccccgggcc 


agccggaCaa 




yy ^yy^y d y L 




acactaacoc 


cgcctctccg 


acaactccca 


gtggattcag 


gtgtccctcg 


tgttccagac 


cctccagcag 


atgcgcgaca 


agaccccgct 


ctccctcaac 


accccgccgg 


gcgaggtgaa 


gctcaccctc 


gccggctgcg 


aggagcgcaa 


cgcccagggc 


atgtgctccc 


tcgccggctt 


cacccagatc 


gtgaacgagg 


cccgcatccc 


ggcctgctcc 


ctctccgaga 


aggacgagct 


gtaa 



40 



<210> 112 

<211> 437 

45 <212> PRT 

<213> artificial sequence 

<220> 

<223> plasmid 11268 amino acid sequence 



<400> 112 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 



Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1314 



305 



WO 2005/096804 PCT/US2004/007182 

20 25 30 



Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
5 35 " 40 45 



Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

10 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu Gly 
65 70 75 80 

15 

His Tyr Trp Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 

85 90 95 



20 Gly Cys Pro Gin Ser Gly Gin Val Ala lie He Ala Asp Val Asp Glu 

100 105 110 



Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
25 115 120 125 



Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 140 

30 

Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 
145 150 155 160 

35 

Val Thr Asp Ala He Leu Glu Arg Ala Gly Gly Ser He Ala Asp Phe 

165 170 175 



40 Thr Gly His Tyr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 

180 185 190 



Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
45 195 200 205 



Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
210 215 220 

50 

Cys Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 

55 

lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

245 250 255 
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5 



10 



Arg He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 

260 265 270 



Ala Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 



Ala Thr Pro Leu Leu Asp Leu He Lys Thr Ala Leu Thr Pro His Pro 
290 295 300 



15 Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 
305 310 315 320 



He Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 
20 325 330 335 



Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 



25 



30 



Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 



He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 380 



35 Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
385 390 395 400 



40 



Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

405 410 415 



45 



Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu Ser 

420 425 430 



Glu Lys Asp Glu Leu 
435 



50 



307 



